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Employment > Wages, and Earnings of Hlspanlcs In the Federal 
an d Non-Federal Sectors; Methodological Issues 
and Their Empirical Consequences 

A major reason for studying employment and earnings differences by 
race and ethnicity Is to determine what such differences Imply both about 
potential employer discrimination and other sources of economic disadvan- 
tage resulting from race or ethnic origin. Much domestic policy Is con- 
cerned with such questions, and Information about the extent to which low 
economic status Is related to employer discrimination or to other factors 
may tiave Important implications for the allocation of resources to dif- 
ferent domestic social programs such as antldlscrlmln&tlon efforts, man- 
power training, and education programs. 1 

The results of statistical analyses of black/white and male/female 
wage and earnings differentials generally reveal that (1) on average, 
black and female wages and earnings are substantially below white male 
wages and earnings, and (2) even after adjustment for productivity- 
related factors such as schooling and labor force experience, the 
adjusted average level of black and female wages and earnings remains 
below the adjusted average level of i#hlte male vages and earnings. The 
difference between the adjusted average earnings or wages of blacks and 
of women and the adjusted average earnings or wages of white men is 
often called "labor market discrimination- to distinguish it from the 
differences in average earnings and wages that result from different 
levels of the productivity variables whose influence has been removed in 
the adjustment. 
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A major stylized fact that aummarlzes most of the empirical evidence 
on wage and earnings differentials Is that both the black/wh:?te and the 
male/female adjusted differentials remain statistically and economically 
Important regardless of the economic model or the statistical technique 
used to analyze the data. Specifically, black/white and male/female 
""labor market discrimination** have- not been fully explained by either 
structural economic theories or statistical adjustments designed to eli- 
minate a plethora of potential biases* In this paper we show that this 
stylized finding does not apply to Blspanlc/Anglo wage and earnings dlf*-^ 
ferentials. Rather, on the whole, Blspanic/Anglo wage and earnings dlf*' 
ferences can generally be explained by human capital differences, self- 
selection biases, and statistical biases arising from Imperfect measure- 
ment of the human capital differences* In particular, most of the dif- 
ference between Rispanica and white non-Hlspanica arises from hturnan 
capital differences* A smaller but still important part of the dlf*-* 
ference arises from statistical biases due to measurement problems* 
Correcting for self-selection bias glvAS essentially the same results as 
ordinary regression analysis* 

It is not possible to discu^is literally all analytical and empirical 
questions about the sources of labor market differences in a single 
paper. Accordingly, we have limited the scope of our analyses in order 
to devote proper attention to (and to extend the range of analyses of) a 
number of specific issues* One issue to which we devote special atten- 
tion is employer wage discriminatloa; another is the extent to which 
employers in the federal and non-federal sectors discriminate by race or 
ethnicity in making wage offers 
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Before proceeding, we define a number of concepts that figure 
prominently in what follows. 

By "federal" and "non-federal" employment we mean, respectively, 
employment in the federal government and employment elsewhere in the 
economy* 

By "ethnicity" we mean Hispanic or non-Hispanic ethnic origin, based 
on the self-declared origin of individuals as either Hispanic or not 
Hispanic. We subdivide Kispanics into two groups: those of Puerto 
Rican origin, and other Hispanics. Of course, non-Puerto Rican Hispanics 
are a heterogeneous group, consisting of Cubans, Mexican-Americans, 
Europeans, Central and South Americans and others. Thus, conclusions 
about the Hispanic group refer to the aggregate of such persons and do 
not necessarily apply equally to each group within this overall aggregate. 
"Black" refers to blacks who are not Hispanic. Persons who are neither 
black nor Hispanic are called >hite non-Hispanics" or dimply "whites." 
Note, however, that the group we call whites ircludes a relatively small 
number of Orientals. American Indians, and others who are not necessarily 
Caucasian. 

By "labor force status" we mean the conventional trichotomy used in 
most government surveys modified so as to distinguish between employment 
in the federal sector and employment in the non-federal sector. Thus, in 
our analyses, any individual's labor force status is always one of the 
following mutually exclusive and exhaustive conditions: employed in the 
federal sector, employed in the non-federal sector, unemployed (that is, 
not employed but seeking* employment ) , or not in the labor force. 
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Finally t by ''ethnic wage discrimination'' we laean any difference In 
total compensation — Including both pecuniary and nonpecunlary 
compensation—that Is associated with differences in ethnicity but is not 
associated with differences in productivity. This definition seems to be 
standard (for example, sea Arrow, 1973, p. 4). Our definition emphasizes 
something that, while implicit in most definitions oj| wage discrimina- 
tion, is worth noting explicitly: wage discrimination means differences 
in total compensation, rather than just in pecuniary compensation per se* 
For example, und^r our definition , pay differentials that are purely com~ 
pensatlng or equalizing in nature are not discriminatory even if they are 
associated with ethnicity but not productivity. By the same token, the 
absence of a difference in pecuniary compensation may also entail wage 
discrimination. For example, an employer who offers Hispanic %rorkers the 
same pecuniary pay but less desirable working conditions? than equally 
productive non-Hlspanic mrkers is behaving in a discriminatory manner, 
in our sense of that term. 

This paper is organized as follows. We first present the economic 
theory underlying our statistical models, and then discuss the statisti- 
cal models. We next present a summary of the data used, discuss our 
results regarding ethnic differences in labor force status, and describe 
the direct regression results from the Survey of Income and Education 
data* The reverse regression results from the SIE data follow; we then 
discuss the structural regression results from the same data. The next 
section discusses statistical results on federal compensation derived 
using an alternative data set^ followed by comparison of all the sta- 
tistical results. The final section presents our conclusions. 
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THE THEORETICAL MODEL 

Like most branches of economics, labor economics Is concerned with 
the analysis of supply and demand. As an actual oi: potential employee, 
the Individual Is chiefly concerned with the labor supply decision: he 
must decide how much to work and the sector In which to wrk subject to 
the constraints he faces. Thus, the Individual is a constrained utlllty- 
maxlmlzer. In the neoclassical sense: he selects the combination of work 
hours, leisure hours, and job characteristics (Including both pecuniary 
and nonpecunlary compensation) that brings the highest possible level of 
happiness consistent with the constraints* Sometimes this maximum 
entails not working at all-*-for example, individuals who do not succeed 
in obtaining a job offer over a given period obviously will not be able 
to work, and other individuals may find that being in school or retire- 
ment is more desirable than employment~in which case the individual 
is either unemployed or not in the labor force. Since the individual 
maximizes subject to constraints, i'c makes sense to say that choices are 
voluntary only if one adds that they are made subject to whatever 
constraints exist. 

While individuals, considered as agents in the labor market, are con- 
cerned %d.th the labor supply decision, the major concern of the firm, as 
an actual or potential employer, is the labor demand decision. The firm 
must decide how high a wage it is willing to offer and what types of jobs 
it requires. Faced %rtth a competitive market for hiring employees, firms 
du not offer more than is necessary to attract proper ^«mployees nor offer 
less than is necessary to fill all po^iitions. 
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Firms may be viewed as continually making job offers, consisting of 
pecuniary compensation and a package of job characteristics %rtilch. In 
effect, constitute nonpecunlary compensation* Individuals may be viewed 
as continually seeking job offers and accepting or rejecting them. Whc^t 
Is observed In a collection of data — for esample, a sample survey — Is the 
outcome of this job offer and job acceptance (or job rejection) process* 
The observed wage and employment outcome Is the result of the process » 
not the process Itself* For example, the fact that a given person 
selects a job In the federal sector over a job elsewhere la correctly 
called endogenous both to the Individual's labor supply decision and to 
the labor demand decisions of employers. 

An individual's sector of '<smployment is at least partly a result of 
an economic decision by the individual about r^hlch job to accept (and 
about whether he will work at all). Each employer assesses the potential 
productivity of prospective emloyees by analyzing the skills they have to 
offer in light of the skills it needs. The employer offers prospective 
workers a package of pecuniary pay and other job characteristics Intended 
to be attractive to them. At the same time, an individual wt\o gets one 
or more offers decides whether to accept one (and, if so, which) or to 
reject all offers. After the decision, an outside analyst observes the 
resulting employment and unemployment. Observed differences in wages, 
job characteristics, or other outcomes (e.g., concentration of persons in 
a particular racial group in a particular sector) are all results of this 
process. 

Since firms seek to maximize profits and understand that workers seek 
to maximize utility, firms will, on average, offer job packages con*- 
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sis ting of pecuniary pay and working conditions that will fill the 
available positions at minimum cost^ A firm whose offers are un- 
necessarily attractive will be flooded with applicants* It, and any com- 
peting enterprise, then knows that It can reduce the generosity of Its 
offers, broadly defined, and still attract adequate numbers of appli- 
cants ♦ Subject to some Important qualifications to be noted below, the 
utility associated with a given job offer will then fall to the minimum 
level required to attract the number of workers the firm wants ♦ In this 
way, then, firms rely on the nature of utility-maximizing behavior of 
individuals and on the nature of a competitive market to bring labor 
supply and labor demand into balance* In all cases » individuals decide 
which oi the options available to them is best, subject to the 
constraints they face* 

Of course, employers may sometimes decide, as a matter of conscious 
policy, to operate out of equilibrium, at least in the senae of an imbal- 
ance between the number of persona willing to work for the employer at 
the current level of generosity of the employer's job package (supply) 
and the number of positions the employer wants to fill (demand) • For 
example, the federal sector may continually and deliberately make job 
offers with compensation in excess of the minimum necessary to fill the 
number of positions it wants to fill. This will re'sult In a waiting 
list, or queue, for federal jobs. When such a queue exists, the various 
jobs available need to be allocated or rationed out among the applicants 
according to some method, formal or informal. For federal government 
employment, one such method of allocation is political— some of the 
available jobs may be allocated through a process of explicit or implicit 
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payoffs. In this situation, different groups in the population have an 
Incentive to compete for the political clout necessary for influence over 
the allocation process. The resources spent coTjipeting for such clout 
eventually bring the system back into equilibrium. If a federal job 
offers a premium over the minimum amount that the individual would 
require in order to be willing to accept it, then the individual will be 
willing to spend resources up to the amount of that premium to get enough 
clout to be offered that job. 

Political allocation may help explain why the federal government can 
make better job offers and have higher minority employment relative to 
total employment than other employers. This higher relative minority 
employment may be in regions where minority political clout is higher. 
For example, minorities may have political clout in regions where 
minority population proportions are higher than they are in the country 
as a whole* This implies that measures of local population proportions 
for minorities may be relevant to analyses of federal employment* 

Of course, non^federal employers, including employers in the private 
sector, may also— like the federal sector — make wage offers in excess of 
the minimum necessary to fill the number of positions they want to fill. 
Marginal private sector employers cannot do so because their profits 
would be driven below the minimum required for survival. Intramargiaal 
private sector employers may do so if they choose. For example, a pri- 
vate sector employer with access to superior production technology will 
be more profitable than average; while this greater potential profitabi- 
lity may accrue to shareholders, it may instead take the form of wage 
offers to some groups that exceed the minimum required to fill the jobs 
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the firm wants to fill. Similarly, a private sector employer may make 
unnecessarily high or excessive offers as a resale of a collective 
bargaining agreement. In cases such as these, as in our previous 
discussion of job allocation through political clout, there will be a 
disequilibrium in the sense that, at the prevailing wage offer, defined 
broadly so as to include nonpecuniary as well as pecuniary rexfards, 
supply will exceed demand. This will induce adjustments that will even- 
tually bring the market back into equilibrium; as before, such adjust- 
ments involve expenditures of resources up to the amount of the premium 
implicit in the employer's offer. In some cases, such expenditures are 
implicit and occur through queueing. In other cases, such expenditures 
are explicit. In still other cases, supply and demand are equated 
through a rationing mechanism that has little to do with productivity 
considerations such a^ when the employer makes offers baaed on factors 
like race rather than on the basis of productivity. 

The labor market, then, settles into an equilibrivjn in %Aich the 
observed distribution of wages and the observed sectoral composition of 
employment are the result of demand and supply decisions* In what 
follows, we are concerned in general terms with intrasectcral differen- 
tials in employment and wage rates by ethnicity, %rith special reference 
CO Puerto Ricans. To clarify the nature of some of the issues in which 
we are particularly Interested, consider the following two questions: 

Question 1: If one were to take a randomly selected group of indivi- 
duals from the population of a given ethnic group and change their 
ethnicity to non-Hispanic (in the case of Hispacics) or to Hispanic 
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(In the case of iion«*Hispauics) ji while kaeplng all of their measured 
and unmeasured productivity-related characteristics the same, then 
would the average of the wage offers made to such persons in a given 
sector differ from the offers that such employers vrould make if they 
knew the actual ethnicity of these individuals! and i£ so, by how 
much? 

Question 2: If one were tc take all the. individtials in a given eth- 
nic group who are employed in a given sector and change their eth* 
nicity to non-Hispanic (in the case of Hlspanics) or to Bispanic (in 
the case of non-Hispanics) , while keeping all of their measured 
productivity-related characteristics the same, then would the average 
of their wages computed on the assumption that they were non-Hispanic 
(in the case of Bispanics) or Hispanic (in the case of non**Hi8panica) 
differ from the actual average of their wages, and if sc, by how 
mvxch? 

The answers to these two questions need not be identical* Both questions 
are of interest for most discussions of employer discrimination in the 
labor market* However, as we emphasize below, a particular statis^tical 
technique may provide a satisfactory answer to one of these questions 
without yielding any direct or useful evidence on the other* 

STATISTICAL MODELS 



Direct Wage Regression 

The vast majority of studies of wage differentials by race, eth- 
nicity, or sex rely on the methodology of direct wage regression* Under 
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this procedure, one fits an earnings function — with a measure of pay such 
as earnings or wages as the dependent variable, and with measures of 
productivity-related characteristics and hypothetically irrelevant 
characteristics (sex, race) as independent variables — by applying least 
squares to data on individuals actually employed in some sector of 
interest. In some cases, as in Mincer's (1974) seminal work, sector 
means all employed persons. In other cases, sector refers to a single 
employer, as in the studies by Smith (1977), Malkiel and Malkiel (1973), 
Oaxaca (1976), Ehrenberg (1979), Osterman (1979), and many others. 
Regardless of how sector is defined, however, all such studies are 
investigating wages given that the individuals in the analysis are all in 
the sector being studied and have both received and accepted an offer 
from that sector. 

It is important to understand what kind of evidence about the source 
and magnitude of wage and earnings differentials is contained in direct 
wage regression results. While direct wage regression may provide 
useful information on some questions, it may provide little or no direct 
evidence on others. Direct wage regressions analyze wage offers that 
have been received and accepted. Thus, while it appears that results 
derived from direct wage regressions may be quite useful for answering 
what we have called Question 2^ they may be much less useful for 
answering what we h^ve called Question 1. 

At the statistical level, it is important to note that, considered 
only in terms of questions on which it can reasonably be expected to pro- 
vide useful information, direct wage regression may provide evidence that 
is misleading-'^in particular, estimates that may be biased or incon- 
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sistenty In a statistical sense* Such bias or Inconsistency can arise 
due either to exclusion of relevant variables or to Inclusion of 
Inappropriate variables* Inclusion of Inappropriate varlables*~more 
generally 9 endogenous varlables'-'^such as occupation may bias direct wage 
regression results* Endogenous variables such as occupational status are 
dependent variables that, along with pay» are simply different aspects of 
the outcome of the Interaction between supply and demand* Treating such 
variables as Independent variables In a direct wage regression confuses 
cause and effect In a fundamental way* 

Exclusion of relevant variables may also bias direct wage regression 
results* For example » prior occupational status may be regarded as a 
measure of the quality of one's work experience prior to becoming 
employed by one's present employer* It Is therefore a productivity- 
related characteristic and> by definition » it is exogenous to the beha- 
vior of one's present employer* Omission of a potentially important pro- 
ductivity indicator of this kind may entail bias or inconsistency in the 
estimates of direct wage regression parameters* 

The problem of omi^ted-variable bias has sometimes been misin- 
terpreted or misunderstood » however* In particular » the fact that an 
omitted variable (e*g*» prior work history or prior occupational status) 
is correlated both with the dependent variable and with an Included inde-* 
pendent variable does not mean that omission of the variable leads to 
bias in the coefficient of any particular independent variable Included 
in the regression* Rather » a coefficient will be biased only if the 
omitted variable is correlated with the dependent variable and \t±th the 
particularly Independent variable* at the margin, l*e*j when all other 
independent variables are held constant* Thus» for example, in order to 
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maintain that omission of prior occupational status will bias the coef- 
ficient on an ethnicity indicator variable, it in neither necessary nor 
sufficient to show that persons in different ethnic groups differ In 
terms of prior occupational status or that prior occupational status is 
associated with pay. Rather, one must show that persons in different 
ethnic groups with the same values for the included variables — age, edu- 
cational attainment, and the like — nevertheless differ in terms of prioi 
occupational status. Thus, the claim that the omission of variables thi 
are plausibly associated with pay even at the margin inevitably biases 
the coefficient on an ethnicity variable in a direct earnings regressior 
is not persuasive, even when there is reason to believe that persons in 
different ethnic groups differ in terms of such relevant omitted 
variables. 

A different but related bias is induced by errors of laeasurement in 
the included variables. It would be surprising if such variables were 
always perfect surrogate or proxy measures of productivity, and it is 
possible that such variables measure actual or expected productivity wit 
error. In this case the coefficients in a direct wage regression may be 
subject to what Roberts (1979, 1981) has called underadjustment bias, A 
statistical procedure used to address this problem is called reverse 
regression^ 

Reverse Wage Regression 

The general phenomenon of measurement error bias in regression 
models h^s received attention for many years, and is a standard topic in 
many econometrics texts (e.g., Kmenta, 1971, pp. 307-322; Maddala, 1977, 
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pp« 292-305) • The problem of measurement error bias in direct wage 
regression, however, has received relatively little attention; most work 
on this subject is quite recent (e.g., Welch, 1973; Hashimoto and Kochin, 
1979; Roberts, 1979, 1980, 1981; Kamalich and Polachek, 1982). Our 
discussion of measurement error bias in direct wage regression and the 
conditions tmder which reverse wage regression may avoid such bias will 
fccu3 on the bivariate case: the relationship between pay and a single 
productivity-relafed characteristic « Either variable may be measured 
with error. (The analysis of the theory of reverse wage regression in the 
multivariate case Involving the relationship between pay and a vector of 
productivity-related characteristics is much less tractable.) 

Assume that the first two moments of the random variables y*, p*, 
e*, and e^ are given by 
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where y* is the appropriate pay v&riable, measured perfectly; p* is the 
productivity index, measured perfectly; e^ is the measurement error in 
the pay variable; acd e^ is the measurement error in the productivity 
variable. The observable pay> y, and observable productivity ^ p, are 
defined as: 

(2a) y « y* + ej 

(2b) p - p* + ej. 
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Accordingly, the first two moments of [y, p] ave given by 
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The system described by equations (2)- (3) is a standard bivariate 
measurement error niodel. True pay, y*, and true productivity, p*, are 
subject to tneasurement errors e* and e*, respectively, which are 
assumed uncorrelated with each other and uncorrelated with the other 
variables in the true system. Since the measuremeut errors have zero 
expectation, the true variables, y* and p*, have the same expected values 
as the laeasured proxies, y and p, respectively. Since the measurement 
errors are uncorrelated with any other variables in the system, the 
measured proxies have the same covariance as the true variables. 
However, the variance of each measured variable exceeds the variance of 
its true counterpart by the variance of the measurement error. 

We consider next the regression relationships connecting the true 
variables and the proxy variables. By definition, the regression of y* 
on p* can be decomposed into the conditional expectation of given p* 
and an expectation error Which la uncorrelated with the conditional 
expectation • We will asstime that the conditional expectations are linear 
in the conditioning variables. In addition, assume that the mean vector 

and the system covariance matrix ^ are different for each race/ethnic 
group i, 1 ■ Hispanic, white non-Hispanic, and black non-Hispanic. For 
each race/ethnic group 1, i:hen, the regression relationships connecting 
the true variables are giveii^by 
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(4a) y* - E[y* | p*]i + r\* 

- a* + b* p* + nj 
(4b) p* - E[p* \ y*]i n| 

- o* + e* y* + n|, 

• . 

where nj and are the errors of the conditional expectations and a^, 
bi, a^^, and 0i are the parameters of the linear functional form for the 
conditional expectations. When the true syatem is Multivariate Normal or 
the system is estimated by least squares using the true variables, the 
conditional expectation parameters are the following functions of the 
underlying system parameters: 

(5a) ^1^^ a* . Uyi - b* ppi 

u>22i ^ 

(5b) a* . upi - 6* My^. 

t^lli ^ 

When the true model is Multivariate Normal, these relationships hold 
exactly. When the true model is only specified up to its firs): two 
moments, as in equation (1), the relationships in (5) hold as the proba- 
bility limits of the least squares estimators of the theoretical para- 
meters when the true variables are used in the analysis. 

Of course, only y and p are directly observable. Consequently, we 
must know the regression relationship connecting these variables in order 
to state the implications of the measurement error problem for the dis- 
crimination analysis of interest. The regression of y on p is defined as 
the conditional expectation of y given p. Once again, by the assumption 
of linear conditional expectations, the regression relationships con* 
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necting the observable variables for each race/ethnic group i are given 
by 

(6a) y E[y* I + e*]i + ni 

■ ai + bi p + m 
(6b) p - E[p* I + ejli + ^2 

• oi + Si y + ni. 

When the true system (1) is Multivariate Normal or i#hen the conditional 
expectations are estimated by least squares using the observed variables, 
the conditional expectation parameters in (6) have the following rela- 
tionship to the theoretical parameters of the underlying systems 

(7a) bi ^I2i ai - Uy± - bi Up± 

W22i + «*>44i 

(7b) - '""^ oii - Mpi - Pi yyi. 

When the true tnodel is ifeiltivarlate Normal, these relationships hold 
exactly. When the true tiodel is only specified up to its first two 
moments, as in equation (1), the relationships in (7) hold as the proba- 
bility limits of the least squares estimators of the theoretical para- 
meters when the observed variables «re used instead of the true variables. 
Notice that the presence of measurement errors and ej causes the 
theoretical regression parameters in equations (5) — the starred values — 
to deviate from the theoretical regression parameters in equation (7) — 
the unstarred values. Technically, the symmetric raeasurement error 
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model has the property that the least squares estimators for the 
regression parameters a^, b^,* a^, and 3i are Inconsistent estimators of 
the regression parameters aj, bj, aj, and connecting the true 
variables. However, It Is straightforward to verify that the conditional 
expectation of the proxy pay variable given the true value of the produc- 
tivity variable Is Identical to the conditional expectation in (4a). 
Similarly, the conditional expectation of the proxy productivity variable 
given the true pay variable is identical to the conditional expectation 
in (4b). 

The Inconsistency in the estimators based on the observed variables 
is at the heart of the criticisms leveled by Hashimoto and Kochin (1979) 
and Roberts (1979, 1981) against the direct regression methodology in 
statistical discrimination analyses. Direct regression is identical to 
least squares estimation of a^ and b^. These estimators are inconsistent 
for the theoretical quantities a* and b* (or and Hj). The effect of 
the inconsistency on the potential inference of statistical discrimina- 
tion based on the direct regression estimates can be seen by considering 
the case in %ihlch each race/ethnic group has the same theoretical values 
of a* and b*. Then, the theoretical average difference in observed 
pay between a member of race/ethnic group i and a member of gi.oup j, 
conditional on the same tnie value of productivity, p*, is given by 

(8) E[yi I p*] - E[yj I p*] - a* + b* p* - (a* + b* p*) - 0, 

since, by hypothesis, a* « a* and b* « b*. However, if the least squares 
estimates of a^ and b^ are used, the estimated difference in pay between 
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a member of race/ethnic group 1 and a member of group j, conditional on 
the same value of observed productivity, p, is given by 

(9) E[yi I p] - E[yj I p] - ai + bi p - (a;, + bj p) 

- a* - a* + (b* - b*) p + b* "^^^ ypi - b* "^^J yp 

U22i + "44i "22j + W44j 

- (ypi - ypj). 
u>22 ^44 

since a* « a* and b* - b*, by hypothesis. Notice that the expression In 
(9) Is not necessarily zero unless ■ ypj — that Is, unless the average 
observed productivity Index Is the same for both groups. Normally, a 
test of the hypothesis of equal theoretical coefficients In the direct 
regression Is considered a basis for an Inference of statistical dlscrl- 
;^lnatlon. Apparently, this test may support an Inference of discrimina- 
tion even though the theoretical coefficients of Interest are equal when 
productivity is measured with error and the groups have different average 
values of the productivity prc^y. 

The analysis is symmetric in its implications for the reverse 
regression methodology. The least squares estimators of and 3i are 
inconsistent for the theoretical parameters a* and Reverse regres- 

sion is identical to least squares estimation cf and 3i. The effect 
of the inconsistency on the potential inference of discrimination based 
on the reverse regression estimates can be seen by considering the case 
in which each race/ethnic group has the same theoretical values of 
a* and 3f. Then, the theoretical average difference in observed 



21 



90 



productivity between a member of race/ethnic group i and a member of 
group J, conditional on the same true value of pay, y*, is given by 

(10) E[pi I y*] - E[pj I y*] • a* -h y* - (a* + S* y*) - 0, 

since, by hypothesis, a* ■ a* and SJ ■ However, if the least squares 

estimates of a^^ and 3i are used, the estimated difference in pay between a 
member of the race/ethnic group i and a member of group j, conditional on 
th(t same value of observed productivity, p. Is given by 

(11) EtPi I y] - E[pj I y] - + y* - (aj+ y*) 

- aj - a* + (&* - S*) y + e* "^^^^ Uyi - S* ^^^J 

^11 ^33 

since a* « a* and ■ 3^, by hypothesis. As we noted for expression 
(9), the mean difference in equation (11) is not necessarily zero unless 
^yi ■ Uyj~that is, unless the average observed pay is the same for both 
groups* Apparently, the reverse regression also may support an inference 
of statistical discrimination even though the theoretical coefficients of 
interest are equal* 

Although equations (9) and (11) are symmetric in their Implications 
for the type of inconsistency induced by least squares analysis of the 
system (1) when only the system (2) is observed, the two inconsistencies 
lead to quite differ^mt errors in a statistical discrimination analysis* 
In general, the covariance between pay and productivity is positive 
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(0)12 > 0), Therefore, the estimated regression slope parameter Is 
expected to be positive whether one estimates b*, g*, b, or 6. 
Consequently, the sign of the Inconsistency depends on the s;lgn of the 
difference In the mean values of productivity or pay for each race/ethnic 
group. If ethnic group 1 has a higher value of the observed productivity 
Index than ethnic group j, then equation (9) Implies that direct 
regression analysis of the observable variables y and p will be biased In 
the direction of finding discrimination favoring group 1 even when all 
coefficients of Interest are equal. However, if race/ethnic group 1 has 
a higher mean value of observed pay than race/ethnic group j, then 
equation (11) implies that reverse regression analysis of the observable 
variables will be biased in th^ direction of finding discrimination 
favoring group j even when all coefficients of interest are equal. 

Roberts (1981) has called this phenomenon the conflict between two 
potential deflnltlona of statistical discrimination. Under his first 
definition, differences in true pay, y*, given the same values of true 
productivity, p*, are evidence of statistical discrimination: that is, a 
racial/ethnic group is discriminated against if it has lower expected 
true pay for a given level of true productivity. As Roberts notes, and 
equation (9) shows, direct regression estimation of the conditional 
expectation of observed y given observed p may give spurious evidence of 
statistical discrimination in the case where one group simply has a 
higher average value of the productivity proxy p than the other. Under 
Roberts* second definition of statistical discrimination, differences in 
true productivity, p*, given the same values of true pay, y* are evidence 
of discrimination: that is, a racial/ethnic group is discriminated 
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against if it has a higher expected true productivity for a given level 
or true pay. As Roberts notes, and equation (11) shows, reverse 
regression estimation of the conditional expectation of observed p given 
observed y may also give spurious evidence of statistical discrimination 
in the case where one group simply has higher average measured pay y than 
the other group. In principle, however, the errors involved in xising 
direct or reverse regression are in the opposite direction. That is, if 
the observed average pay of group i is greater than the observed average 
pay of group j, then nhe observed average productivity of group i is very 
likely to be higher than the observed averag«£ productivity of group J. 
Under these conditions, direct regression analysis of the proxy variables 
y and p may lead to an inference of discrimination against group j %^ile 
reverse regression analysis of the same proxy data 'aay lead to an 
inference of discrimination against group 1. 

The direct and reverse conditional expectation definitions of sta- 
tistical discrimination are not actually different. When applied to the 
true variables y* and p*, either definition of discrimination leads to 
the same implications for the structural parameters and as equations 
(8) and (10) show. In general, true pay cannot be measured exactly since 
the appropriate measure would include current compensation, fringe bene- 
fits* the monetary value of future promotion possibilities, future bene- 
fits, and on-the-job amenities Similarly, true productivity cannot be 
measured exactly since the true index depends on schooling, types and 
quantities ol previous experience, and various other factors that may be 
difficult to quantify. The importance of the analysis of direct and 
reverse regression methods for estimating thci parameters underlying 
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either definition of statistical discrimination Is that, under typical 
conditions, the two statistical methods vlll result In estimates 
that bound the actual magnitude of discrimination. (However « as noted 
below, a potentli^l problem Tflth either direct or reverse methodology Is 
the implicit assumption that. If the structure In equation (1) differs 
across race/ethnic groups in such a way thit either equation (8) or (10) 
Is not zero, then such structural differences can erroneously be 
Interpreted as differences in the behavioral equations governing the 
employment practices of the employer or sector being analyzed*) 

We have derived a version of the reverse %fage regression method fsor 
use in analyses comparable to tVie direct regression models. The proce- 
dure Involves two steps* In the first or ''direct'* stage, we compute an 
underlying direct regression using a randomly selected half of the white 
non-Hispanic observations available to us* We use only half of the 
available observations to fit the direct regression because these esti- 
mated coefficients id.ll be used to form a productivity index for the 
remaining half of the white non-Blspanics and all the black and Hispanic 
observations* (Splitting the sample avoids Inducing spurious correla- 
tion between the computed productivity index and the wage rates in the 
reverse regressions*) The direct re^^ressions used in the first stage 
involve all the productivity indicators used in the direct regression 
except, of course,* the ethnicity indicators and interactions involving 
these indicators. 

In the second or '^reverse'* stage i we tise the conventional wage or 
earnings function coefficient estimates from the direct stage to compute 
predicted wages or earnings y for the remaining observations. We treat 
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this constructed variable 7 as ? proxy measure of productivity. 
Accordingly, y becomes the dependent variable in cur second-*stage reverse 
wage regression. We compute 

(12) y - a + bo'd^+ bl 7 + n» 

where ^ is a vector of race and ethnicity indicators, y is a measure of 
pay (e.g., the logarithm of the hourly wage), and n is the regraasion 
error term. Thus y is a linear function of y (and 

Structural Wage Regression 

Both direct and reverse wage regression are concerned %d.th con- 
ditional wage relationshipa. Such techniques are therefore directly con- 
cerned %d.th what we have called Question 2~identifying the within*-sector 
differences in wages and earnings for di.i'fereat race/ethnic groups. 
However, th^y do not:> in general, estimate che parameters governing ^he 
structure of the underlying process of supply and demand that generates 
wage offers; rather, they constitute analyses of the outcome of that pro- 
cess. Neither direct nor reverse wage regression addresses what we have 
called Question 1 — identifying ':he across-sector differences in wages a^d 
earnings opportunities for different race/ethnic groups. 

In order to obtain answers to Question 1, it is necessary to address 
-directly the question of the determinants of wage offers. Unfortunately, 
most data sets, particularly survey data sets» contain information on 
only a subset of all wage offers — namely, the ones that have been both 
received and accepted. In particular, in terms of our federal/ 
non-federal sector dichotomy, most cross-sectional survey data on any 
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given individual contain information on only one offer (from either the 
federal or the non-federal sector) for employed persons, and do not con- 
tain information on any offer, from either sector, for persons who are 
unemployed or not in the labor force* 

Such data are said to be censored, in the sense that the investigator 
does not know the values of certain variables of interest: in the pre- 
sent case, he does not know the values of the federal sector offers 
available to persons working in the non-federal sector or the values 
of non-federal sector offers available to persons working in the 
federal sector;;moreover , he does not know the values of the offers from 
either sector that are available to persons who are unemployed or not in 
the labor force. Restricting one's analysis to a given sector aggravates 
the problem: intrasectoral data are truncated, in the sense that a 
sample consisting exclusively of intrasectoral data is one from which 
data on persons outside the sector being analyzed have been discarded* 

To ignore this truncation completely, as in an intrasectoral direct 
or reverse wage regression analysis, may subject a study to sample selec- 
tion bias, at least insofar as answers to Question I are concerned (see 
Heckman, 1979; Beckman, Klllingsworth , and MaCurdy, 1981} • Sample 
selection bias may arise in such a study because the data to be used con- 
tain only observations on persons who have received and accepted an 
offer from the sector in question* For example, the observations con- 
tained in data for a given sector are in part self -selected, in the sense 
that, having received an offer from employers in that sector, the persons 
observed in the data for that sector have all selected themselves into 
the sample to be analyzed* Application of direct or reverse wage 
regression to a self-selected sample of this kind may not yield con- 
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slatent estimates of the parameters of the employer's wage offer func- 
tion. More generally, a sample of this kind has a sampling distribution 
determined by both the survey design and the respondent in the sense that 
it consists of persons who have accepted offers. Thi3 makes it not only 
a self-selected sample, in the sense us^d above, but also a ''selected 
sample" in the sense that such persons must first have received offers 
from, and thus must have been selected by, employers. 

This suggests that one way to avoid Che self-selection biases that 
may arise in the context of direct or reverse regression analysis of an 
intrasectoral sample is to derive a model that not only (i) specifies the 
determinants of wage offers— the relation of primary interest — but also 
(ii) describes the process of selection by which the individuals In such 
a sample got into the sample. We start by deriving a model of the selec- 
tion process, and then show how this model may be used in conjunction 
with a model of the determinants of wage offers to obtain consistent 
estimates of the structural wage offer functiou. 

Since the data in the 1976 Survey of Income and Education (SIE), 
which are used in most of t'Ae studies di^acusaed here, refer to a period 
of unusually severe recession, it is worth notli\g that problems asso- 
ciated with selection bias may be more Important in thefi^e data than they 
would be in data that referred to a period when business-cycle conditions 
were more normal. For example, results based on direct (or reverse) wage 
regression analyses of these data might lead to misleading inferences 
about employer offers by virtue of the fact r^?it nonemployment—either 
unemployment or absence from the labor force induced by the 1975-76 
downturn—during 1975-76 was well above the Icjvel observed in more normcil 
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periodij. In constrast, structural regression in effect makes a statisti- 
cal correction for possible biases that might be introduced by such 
pheonomena. Employer wage offers may themselves be affected by cyclical 
downturns such as the one observed during 1975-76, and structural 
regression techniques cannot b?. used to correct for the Impact of a slump 
on wage offers as such. However, structural regression techniques do at 
least permit a correction for the way in which a cyclical ^iownturn— and 
the rise in nonemployment during a downturn— might otherwise confound 
attempts to obtain unbiased measures of the determinants of employer wage 
offers. 

We first derive a model of the way in which Individuals are selected 
into different sectors — i.e., of the determinants of the labor force sta- 
tus of individuals, categorized, as before, as being (i) eaployed in the 
federal sector, (ii) eaployed in the non-federal sector, (iil) 
unemployed, or (iv) not in the labor force. This model may be used to 
compute labor force status probabilities (i.e., the probability that 
li»hor force status will be any one of these four distinct cate[>ories) for 
every individual. These probabilities may then be used to form Instru- 
mental variables for structural %rage regression. 

The basic notion underlying our moJel of labor force status deter- 
mination is the idea of an index function model (see Heckman. 
Killingsworth, and MaCurdy, 1981) or, more or less equlvalently , a 
discrete choice model (see McFadden, 1973, 1975). An index function 
model represents the decision-making procei;s of an agent who is faced 
with the problem of having to choose the best of several alternatives. 
Associated irtth each . alternative is a particular payoff or reward that is 
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represented by the value of an Index • The alternative actually chosen is 
the one with the highest index—that is, the one with the biggest payoff. 

Specifically, recall that we have established four alternative possi- 
bilities for labor force status, and let the utility or payoff U asso- 
ciated with each possibility, or sector, s, be given by 

(13) Ug - V(Wg, qg, x) + v( Wg , qg , x), 

where V, the systematic component of U, is a function of the wage offered 
to the individual by employers in that sector; qg is an index of the 
characteristics associated with that sector (e.g., one^s home or school 
environment, for the **not in the labor force" sector; the work environ- 
ment, for the federal employment sector); is a vector of observed 
characteristics of the individual; and v is an error term (the stochastic 
component of U). Note that no wage is relevant to being in the 
unemployed sector or the "not in the labor force" sector. The individual 
will choose to be in a particular sector s if the utility associated with 
that choice exceeds the utility associated with an/ other choice. For 
example ji the individual will choose the federal sector if and only if 

(14) Uf > Max(On, Ou, U^), 

wliere the f: subscript refers to the federal sector, n refers to the non- 
federal sector, u refers to the unemployment sector, and o refers to the 
"not in the labor force" sector. Expressions similar to (14) define the 
circumstances under which the individual willl choose non-federal 
employment, unemployment, or absence from the labor force. Note that all 
such choices are subject to the values of the wage offers received from 
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the federal and non-federal sectors, Wf and Wn* Thus, as before, choice 
Is subject to constraints, and statements that choice is voluntary make 
sense only if one understands both that such choices are constrained and, 
thus, that the f^ct that such choices are voluntary has no particular 
normative implications. Note also that non-receipt of an offer from the 
federal or non-federal sector may be treated as, and is treated in this 
analysis as, the equivalent of receipt of a very low offer from that sec- 
tor. 

To specify the decisions process (13)-(14) in c manner suitable for 
empirical estimation, let the systematic component V of the utility func- 
tion for sector s (s ■ f, n, u, or o) be given by 

(15) V(Wg, qg, x) - ai(qs) Wg + x;a2(q8)* 

where ai(.) and a2(.) are, respectively, a scalar and a vector function 
of ^8* which vary across sectors because of their dependence on the 
characteristics qg of that sector. Next, assume that the logarithm of 
the (best) wage offer available to the individual from employers in sec- 
tor s (s ■ f or n) is given by 

(16) Wg - zfbg + eg, 

where z is a vector of observed variables that affect the wage offer 
Wg and eg is an error term whose population mean is zero* Substitute 
(16) into (15) and rearrange terms, to obtain 

(17) - z_'y^^ + .-TTj^ + ^ + ^, 



31 



100 



where « ai(qs) 

las " ilCqs) 

V* - eg ai(qs) + v(Wrt, qg, x) , 

which is linear in all observed w^^ables and x^. (Note that some ele- 
ments in £^ may also appear in x^, and vice ^ersa) • 

Finally, let the distribution of the random term In (17) be 
approximately independent Weibull. This means that intersectoral dif-* 

ferences between these errors, v*-v*, v*-v*, vJ-V*, etc., are all 

' f n' f u' f o' 

approximately independent logistic • 

Together with (14), the independent logistic assumption implies that 

(18) Pr{in sector s} - ^^^^^s^ 

exp(V|) + exp(V*) + exp(V*) + exp(VJ) 

for s ■ f, n, u, or o. Thus, (18) gives the probability that an indivi- 
dual will b^* in any given sector s as a logistic function of x_ and 
Note that (18) is therefore a reduced form expression, since it contains 
both supply and demand variables* 

We now consider how to use estimates of parameters governing labor 
force status, i»e., estimates of (18), to obtain estimates of the parame 
ters of the wage offer equation. We refer to this as structural wage 
regressions • 

As noted earlier, we consider two kinds of employment in our 
analyses: federal and non-federal employment. Let N3 be the number of 
persons in sector s; s ■ f or n. Let Wg be the logarithm of the (best) 
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wage offer for work in sector s available to an individual with charac- 
teristics x^, and assume that Wg is given by (16) above. 

Now, (16) is an expression for the wage Wg that the individual will 
receive if he works in sector s and, by assumption, the mean value of 
Wg in the population as a whole, givsn z, is 

(19) E[wg I z) ' z:bs. 

On the other hand, the mean value of Wg, given z, among persons actually 
working in sector t is 

(20) E[W3 \ Zj 8 - t] -^fbt + E[es \ ±, s - t] . 

Note that (19) and (20) are equivalent only if the conditional mean of 
eg is independent of the condition s ■ t, i.e., only if the population 
mean of the error term eg and the mean of eg among persons actually 
employed in sector c are the same. If not, then, in terms of the 
discussion in the previous section, persons, in sector s are a selected 
sample. The sampling distribution of the eg in the data is not the same 
as the distribution of the eg in nature. This is the case in which 
conventional least squares analysis of the regression based on a sample 
restricted to persons actually in sector s will yield biased estimates of 
the parameters of the wage offer function bg. Such a regression in 
effect ignores the second term on the right-hand side of (20), and so 
will suffer from omitted variable bias, where the omitted variable in 
question is the conditional mean of eg. (For further discussion of this 
point, see Heckman, 1979.) 
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To derive an alternative to conventional regression that may be used 
to obtain consistent estimates of the parameters of the wage offer func- 
tion» note that 

(21) E[W3 I ^, s - f ] - 

00 OB 

J, J, iff(wf . x_) p(wf, I z) dwf dwji 

^ f 

/ / iif(wf , Wji, x_) p(wf , 1 _z) dWf dWji 

Where iTgCwf , w^, x) - Pr{in sector s I Wf , w^. x} and p(wf , w^ I - the 
joint density function of Wf, w^ conditional on z. Approximate the 
numerator of (21) with a first order Taylor series around the means of 
Wf and Wq. Approximate the denominator of (21) with the unconditional 
probability of choosing sector a to obtain an ov^srall approximation; 

(22) E[w,3 s - t] . zfbt ^t(r^> i:Wi^> iL) ^ 

~t 

where iTg(wf , w^, x) his been evaluated at mean values of Wf and w^, and 
TTg is the average value of ttq in the population. Note that Tr^ ia the 
probability that an individixal will be in sector s and may be computed 
using estimates of the parameters of (18), while Tg ±s the proportion of 
all persons in sector s« 

Equation (22) suggests an instrumental variable estimator of the 
coefficients 2s the structural wage equation (16). The basis for this 
claim is the form of the approximation to the conditional expectation of 
the wage given the sector of employment in equation (22). This is the 
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approximate regression function for Wg given employment in sector s and 
the exogenous variables Therefore, by construction, the variables on 
the right-hand side of (22) are orthogonal to the error term in the 
Sector-specific wage regression. These right-hand side variables depend 
on an unknown ratio X - irC^'bf, ^zfbn, x)/ Tg, which is che ratio of the 
probability of being employed in sector s evaluated at the mean value of 
the wage in each sector, given z^, to the average probability of being 
employed In sector s. This ratio fluctuates around unity. It is higher 
for individuals with higher than average probabilities of being in sector 
8 and lower for individuals with lower than average probabilities of being 
in sector s. This ratio may be estimated by using as the numerator pfo- 
bability the fitted value of the estimated logit probability developed 
above and using as the denominator probability the sample proportion in 
sector &• 

Having developed an estimator for this ratio, we are faced with a 
choice of strategies for estimating bg. First, we could regress the 
sector-specific wages on the product of ^ and the ratio X. Since the 
ratio X is estimated, this strategy will lead to problems in determining 
the appropriate measure of precision for this estimator. Alternatively, 
one may use X to develop a set of instruments that are correlated with z 
but uncorrelated with the error in the conditional wage expectation given 
2^ and the sector of employment* These instruments are exactly the right- 
hand side of equation (22). The X must still be estimated; however, this 
approach does not lead to problems in estimating standard errors because 
the convergence of the moment matrix of the instruments is guaranteed by 
the consistency of the logit parameter estimates and by the fact that no 
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nonlinear Instruments are used as right-hand side variables In the 
equation being estimated* The estimated residuals may be 
heteroskedastlc; however. In estimation we allow for this possibility* 
Each row of the Instrument matrix Q Is defined as 

(23) Sii ' ^3if 

where Xg^ - i^U^^bf , ^'b^, x^)/ Fg, 1-1, Ng, and Ng - the total 

sample In sector s« To allow for potential mlsspeclf Icatlon of the 
probability-generating process we add a set of Instruments, qj^, defined 
as 

2 

(24) m Xg^. 

The complete Instrument matrix Q, then, consists of Ng rows of 

% ^^]* The bg are estimated using Instrumental variables: 

(25) bg^. [Zs'Q8(Q8'Qs)"lQ8'28]-lZg'Qs(Qs'Qg)-X'w3, 

where Z is the Ng by k matrix of wage equatlou variables, Q is the 
Ng by 7.k matrix of Instruments, and W3 is the Ng by 1 vector of wages 
observed in sector s* The estimator of the asymptotic variance- 
covarlance matrix is 

^2 , . 

(26) Var[^] - Jg [Zg'Qg(Qg'Qg)-lQg'Zg]-l , 

-2 

where Cg is the sum of squared structural residuals divided by the sample 
size Ng 

(27) ag - (w^- Z'bs)'(w^- Z-bs)/Ng- 
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Conceptually, the structural estimator of the parameters relating 
Wq to Is quite different from both the direct and reverse regression 
estljiators of those parameters. If ^ Includes a vector of race/ethnic 
Indicators, say d, then the structural model developed in this paper 
estimates the coefficients on d_ for the population conditional expec- 
tation of Wg given jc_ and not for the subpopulation conditional expec- 
tation of Wg given 2^ and s « t for some sector t. This difference is 
important, since the structural model 2t:trl>utes behr^^ior^^"* f?lgnificance 
to the population conditional expectation and not to the self-selected 
subpopulation conditional expectation. In a direct or reverse r^.,rression 
analysis of race/ethnic pay differences, the conditional ejcpectation of 
pay, given the productivity index £;'bg and given the sector of employment 
s, may differ across groups because of systematic differences in the 
employers' pay practices (the usual assumption in statistical discrimina- 
tion analyses) or because of systematic differences in the workers* pre- 
ferences, as modeled by the sectoral choice model above. In geur^r^l the 
conditional expectation of pay, given £^^bg and sector s, may differ 
across race/ethnic groups because of variation in labor demand (employer 
policies) or labor supply (employee policies). The structural nodel 
developed in this chapter makes assumptions sufficient to identify the 
parameters underlying labor demand (but not labor supply) , permitting 
estimation of the conditional expectation of pay offers given only 2. 

DATA USED IN EMPIRICAL STUDIES 

Most of the data used in the empirical studies described in this 
report are derived from the 1976 Survey of Income and Education (SIE). 
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The SIE was conducted during April-July 1976 by the U.S« Bureau of the 
Census for the U.So Department of Healthy Education and Welfare (now the 
Department of Health and Human Services)^ and was the largest national 
survey since the 1970 Census of Population* Most of the procedures and 
definitions used In the SIE arc Identical to those used In the annual 
Maroh Current Population Survey (CPS), Lat the SIE alao contains 
questions pertaining to Income, education, and language skills that are 
not contained in the CPS. For further description of the SIE, gee U.S.. 
Bureau of the Census (1978). 

We have excluded persons under 21 y^ars old. Among persons 21 years 
old or older In the SIE data base, 8,168 are Hlspanlcs, 19,501 are black 
non-Els panics, and the remainder (246,837) are whites (that Is, persons 
neither Hispanic nor black). Ethnicity Is self -repor ted « Race, however. 
Is determined by Interviewer observation. 

Second, we have excluded persons not residing in the continental 
United States; our data therefore exclude persons residing in Hawaii and 
Alaska, and also, of course, persons living in Puerto Rico. 

The SIE therefore refers to a sample of persona in the country as a 
whole, and geography undoubtedly has major effects on pay through its 
association with such factors as (1) regional cost-of-living differen- 
tials, (11) regional differences in amenities and also, to the extent 
that labor is immobile, (ill) regional differences in factor proportions 
(for example, see Klefer and Smith, 1977). Moreover, there are Important 
regional differences in the location of minority populations and the 
location of various industries, including the federal government. In all 
of our analyses, geography, specifically locatlonal choice, is taken as 
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exogenous* Nevertheless » we have taken several measures to ensure that 
minority groups are compared with nonmlnorlty groups from the same 
geographic region. The sampling design of the SIE oversampled less poou- 
Irted states I meaning that the geographic distribution of eraployment 
opportunities Is not sampled randomly. 

In order to control for the differences In labor demand across 
geographic regions » we have used two sets of geographically matched 
samples in our analyses. The loglt models of the labor force status were 
estimated using samples of blacks and of white non-^Hlspanlcs that were 
geographically matched to our sample of Hlspanlcs* Regression analyses 
were performed on federal and non-federal samples that were geographi- 
cally matched to the federal sample. 

We did this geographic matching by state and by what the SIE calls 
central city code, which categorizes persons according to residence in 
the following way: (1) located in fh.e central city of a Standard 
Metropolltal Statistical Area (SMSA), (2) located in an SMSA but not in a 
central city» (3) located outside an SMSA» and (4) location not disclosed 
(in order to avoid breaching Census regulations governing 
confidentiality)* Relatively small nixmbers of persons » mainly persons 
residing in outlying areas » fall into the last of these four categories* 
Thus» for example > after determining the total number of Hispanlcs living 
in the central city area of the Los Angeles-Long Beach SMSA in 
California^ we randomly selected equal numbers of black non-Hlspanlcs and 
of whites from the total populations of such persons in the same area; 
and similarly for all other areas* The result of this process of 
matching was three samples (of Hispanlcs , black non-Hi s panic 8 > and 
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whites, respectively) with the same sampling probabilities for each state 
and central city code. In addition, we produced two samples (of federal 
and non-federal employees, respectively) with the same sampling probabi- 
lities for each state and central city code. Therefore, five analysis 
samples were produced: three which were geographicaly matched to the 
Hispanic data and two which were geographically matched to the federal 
data. 

For the samples geographically matchad to the Hispanic ssmple from 
the SIE, the sampling probabilities for Kispanics and whites are iden- 
tical for each state and central city code. However, because there were 
not enough black non-Hispanics in the original SIB sample for the West 
and Southwest regions, this group is undersampled for these regions in 
our sample. Ml federal employees in the SIE are included in the federal 
sampl-i. .Q the non-federal sample, whites are exactly matched geographi- 
cally hut Hispanics and black non-Hispanics are oversampled. Since eth- 
nicity and location are always conditioning variables in the analyses 
using the federal and non-federal samples, the oversampling of blacks and 
Hispanics can be expected to reduce sampling error on ethnicity effects 
without Inducing a location bias. 

Since we are not able to observe the actual work experience of the 
individuals in our data, we must use a measure of potential work 
experience (Mincer, 1974) defined as current age less years of schooling 
less 5. The problems associated with this proxy are well known, par- 
ticularly as regards male-female differences in potential vs. actual work 
experience. Accordingly, we think it appropriate in analyzing differen- 
tials in employment status, wages, and earnings to consider men and women 
separately. 
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Annual earnings » as defined In our studies, Is the total amount of 
Income from work received during the year 1975. The hourly wage, as used 
In our studies, Is computed as the ratio of annual earnings to annual 
hours of work, where the latter Is computed as the product of weeks 
worked during the year 1975 and usual hours worked per week during the 
year 1975. Labor force status Is defined according to standard Current 
Population Survey concepts (U.S. Bureau of the Census, 1978) as of the 
week preceding the actual survey date* 

The period 1975*76 was part of an unusually severe recession. This 
may have Implications for the Interpretation of our results. In par- 
ticular, differentials of any kind (skill, racial, etc.) may tend to 
widen during business-cycle slumps and narro^r during booms. To the 
ertent that this is true, the various effects we discuss in this report 
may overstate somewhat the effects that would be observed during more 
normal (less recessionary) times. 

In addition to the SIE we also used the federal government's Central 
Personnel Data File (CPDF)^ The CPDF is a payroll data set based on 
federal personnel files. CPDF data are derived from various federal 
payroll documents and are used by the '3ederal Office of Personnel 
Management and other federal agencies in studying characteristics of the 
fsderal civilian work force, in personnel planning, and in other related 
activities. The CPDF is longitudinal in nature, having begun in 1972 and 
having been updated on an anniial basis elnce that time; thus, it permits 
analyses of several different jears. Finally, since the CPDF covers 
essentially all federal employees. It contains large numbers of Hispanlcs 
as well as large numbers of persons in other racial and ethnic groups. 
(For further details on the CPDF, see Schneider, 1974.) 
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In computing results using the CPDF, we started with samples of 5,000 
Hispanics and 5,000 non-Hispanics , selected randomly from the total CPDF 
populations present in each of the years 1975, 1976, and 1977 • As in our 
work on the SIE data, we then excluded persons who either (i) were not 
living in the continential United States or (ii) were under 21 years old. 
This reduced a given year*? sample by about 12Z to about 8,800 people. 
About 15Z of the persons remaining in any given year's sample after appli- 
cation of this exclusion could noi: ba included in. the regression for that 
year due to missing data (mainly for educational attainment or, to a 
lesser extent, race or sex)o Also, we computed regressions for each year 
separately for each sex. Thus, the total size of the sample used for 
regressions for a given sex for a particular year is between about 2,000 
(in the regressions for women) and about 5,600 (in the regressions for 
men) • 

In order to provide a basis for comparisonn between the various sta- 
tistical procedures described earlier, we estimated a set of different 
wage and earnings models using the same data and definitions. We briefly 
discuss the design of these models* All regression models for wages and 
earnings based on the SIE use the same sets of explanatory variables. 
The regression models for wages and earnings based on the CPDF use dif- 
ferent but similar explanatory variables. Tne logit models for 
employment sector basr;d on the SIE use an abbreviated set of explanatory 
variables. We describe each explanatory variable list in turn. 

The dependent variable for the wage and earnings analyses based on 
the SI^ id either the log of the hourly wage rate or the log of annual 
earnings. Independent vari^^bles capture effects on wages associated with 
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human capital, ethnicity, race, age, geography, and other factors. A 
list of all variables used in the wage and earnings regressions based on 
the SIB data is as follows: 

Dependent Variables 

either the natural logarithm of the hourly wage rate 
or the natural logarithm of annual earnings 

Independent Variables 

Group A variables (ethnicity and race indictors—variant 1): 
1 if Hispanic » 0 otherwise 
1 if black and not Hispanic , 0 otherwise 

Group B variables (ethnicity and race indicators — variant 2): 
1 if Puerto Rlcan, 0 otherwise 
1 if Hispanic but not Puerto Rlcan, 0 otherwise 
1 if black and not Hispanic, 0 otherwise 

Group C variables (human capital, geography^ and other factors): 
ntimber of years of formal education 
1 if graduated from high school, 0 otherwise 
1 if graduated from college, 0 otherwise 
1 if any postgraduate education, 0 otherwise 
1 if currently a full-time student, 0 otherwise 
1 if currently a full-time public school student, 0 otherwise 
number of years of education received outside the U.S. 
1 if had any education outside the U.S., 0 otherwise 
1 if taught in English, 0 if taught in any other language 
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1 if U.S. -born, spoke English as a child, and speaks English now; 

0 otherwise 

1 If not U.S. -bom, 0 otherwise 

number of years lived In U.S. (equal to zero, for persons bom 
In U.S.) 

1 If English not the primary language spoken as a child, 

0 otherwise 

1 If English not the primary language spoken now, 0 otherwise 
1 If English not spoken or understood very well, 0 otherwise 

1 If has any physical condition limiting ability to work, 

0 otherwise 

1 if age is over 30 and under 41, 0 otherwise 
1 if age is over 40 and under 51, 0 otherwise 
1 if age is over SO and under 65, 0 otherwise 
1 if age is over 64, 0 otherwise 

potential experience (age minus years of schooling minus 5) 

square of potential experience 

1 if employed part-time, 0 otherwise 

1 if a veteran, 0 otherwise 

1 if lives in New England area (Maine, New Hampshire, Vermont, 
Massachusetts, Rhode Island, Connecticut), 0 otherwise 

1 if lives in Middle Atlantic area (New York, New Jersey, 
Pennsylvania), 0 otherwise 

1 if lives in East North Central area (Ohio, Indiana, Illinois, 
Michigan, Wisconsin), 0 otherwise 

1 if lives in West North Central area (Minnesota, Iowa, Missouri, 
North Dakota, South Dakota, Nebra&^ka, Kansas), 0 otherwise 

1 if lives in South Atlantic area (Delaware, Maryland, District 
of Columbia, Virginia, West Virginia, North Carolina, South 
Carolina, Georgia, Florida), 0 other^^ise 

1 if lives in East South Central area (K€:ntucky, Tennessee, 
Alabama, Mississippi, Arkansas ^t^uisiana, Oklahoma, Texas), 
0 otherwise 
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1 if lives in Pacific area (Washington, Oregon, or California), 
0 otherwise 

Group D variables (population proportions and interactions): 

proportion of population in area (classified by state, SMSA, and 
central city) that is black non-Hispanic 

proportion of population in area that is Hispanic 

proportion black non-Hispanic in area times years of school 

proportion Hispanic in area times years of school 

proportion black non-Hispanic in area times potential experience 

proportion Hispanic in area times potential experience 

Group E variables (interactions with race, ethnicity indicators): 

Hispanic indicator times years of school 

black non-Hispanic indicator times years of school 

Hispanic indicator times high school graduation indicator 

black non-Hispanic indicator times high school graduation indicator 

Hispanic indicator times college graduation indicator 

black Pon-Hispanic indicator times college graduation indicator 

Hispanic indicator times postgraduate education indicator 

black uon-Hispanic indicator times postgraduate education indicator 

Hispanic indicator times potential experience 

black non-Hispanic indicator times potential experience 

Hispanic indicator times square of potential experience 

black non-Hispanic indicator times square of potential experience 

Group P variables (interactions between race, ethnicity indicators, 
and population proportions): 

black non-Hispanic indicator times percent black non-Hispanic in 
area 

black non-Hispanic indicator times percent black non-Hispanic in 
area times years in school 
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black non-Rlspanlc Indicator times percent black non-Hlspanlc In 
area times potential experience 

Hispanic Indicator times percent Hispanic In area 

Hispanic Indicator times percent Hispanic in area times years in 
school 

Hispanic Indicator times percent Hispanic in area times potential 
experience 

Group A and Group B variables are indicators for minority status* 
Group A identifies Hispanlcs and blacks who are not Hispanics* Group B 
uses the same black non-Hispanic indicator but distinguishes between 
Hispanic subgroups, i.e*, those of Puerto Rican origin and other 
Hispanlcs • 

Group C variables are forms of the basic human capital variables nor-* 
mally found in direct wage regressions. The exact form of these 
variables isi of course, limited by the nature of the data available 
in the SIE. These variables—for education, age, potential work 
experience, and the like — are proxies Intended to capture the employer's 
attempt to estimate the productivity of potential employees. 

Some variables in Group C go beyond the basic proxies used in most 
previous research. Variables for years of education outside the United 
Scates and for not speaking English as one's primary language are 
Intended to capture effects of immigration and language skills that may 
affect earnings (see Chlswlck, 1978, 1980). Indicators of geographic 
location reflect the possible Impact of region (that is, regional price 
differentials, capital-labor ratios, etc.) on job offers. 

Group D variables reflect local Hispanic and black non-Hispanic popu- 
lation proportions. These population proportions are also multiplied by 
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years of school or potential experience in order to capture possible 
Interactions. Group E variables are Interactions between human capital 
variables (schooling and potential experience) and minority status. 
Group F variables are triple-interaction effects, i.e., minority indica- 
tors multiplied both by minority population proportions and by either 
years of school or years of potential experience. 

Since the CPDF is similar to the personnel data files of a single 
employer, the variable list for the regression analyses based on these 
data Includes more detailed information on the individual's work history. 
The variable list does not Include the detailed educational, language, 
and immigrant backgound data found in the SIE. The variables used in the 
regressions based on the CPDF are as follows: 

Dependent Variables 

natural logarithm of annualized salary 

Independent Variables 

Group A (race and ethnicity indicators): 
1 if Hispanic, 0 otherwise 
1 if black, 0 otherwise 

Group B (expanded race and ethnicity indicators): 
1 if Hispanic, 0 otherwise 
1 if black, 0 otherwise 
1 if Oriental, 0 otherwise 
1 if American Indian, 0 otherwise 
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Group C (human capital, geographic location, etcO: 

educational attainment Indicators (1 if possesses the Indicated 
characteristics, 0 otherwise) for each of the following mutually 
exclusive categories: 

completed elementary school, did not complete high school 

has some high school education, but did not complete high school 

has high school diploma or equivalent 

attended terminal occupational training program, but did not 
complete It 

completed terminal occupational training program 

attended less than one year of college 

attended one year of college 

attended two years of college 

has assoclate-ln-arts or equivalent degree 

attended three years of college 

attended four years of college, but did not receive B.A, or 
equivalent degree 

has B«A« or equivalent degree 

has B.A. or equivalent and some post-B.A. training 
has first professional degree (e.g., J.D., M.D.) 

has first professional degree and some post-flrst-professlonal- 
degree training 

has M.A. or equivalent degree 

hiSS M.A. or equivalent and some post-M.A. training 

has a ?.»lxth-year degree (e.g.. Advanced Certificate In 
Education) 

has a sixth-year degree and some post-sixth-year degree training 
has Ph.D. or equivalent degree 

has Ph.D. or equivalent degree and some post-Ph.D. training 
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years since highest degree, for persons with at least a B.A. or 
equivalent (for persons with less than a B.A., this variable is set 
at zero) 

square of years since highest degree 

indicators for field of highest degree, for persons with at least 
a B.A. or equivalent (1 if field of highest degree is the one 
indicated and zero otherwise; set at zero for all persons with less 
than a B.A.), as follows: 

medical doctors (M.D., D.D.So, D.V.M., etc.) 

allied health professions (nursing, therapy, etc.) 

mathematics, architecture, engineering, data processing 

physical or biological sciences 

arts or humanities 

social science&i 

law 

age 

square of age 

years employed in federal government 

square of years employed in federal government 

product of age and years employed in federal government 

1 if has physical or mental disability, 0 otherwise 

indicators for veterans* prefer^inciS (1 if possesses the indicated 
type of veterans* preference, 0 otherwise), as follows: 

five-point veterans* employment preference 

ten-point disability veterans* employment preference 

ten-point compensable veterans* employment preference 

ten-poiat other veterans* employment preference (e.g., spouse, 
survivor) 

indicators for state of residence (1 if lives in a particular 
state, 0 otherwise) for all 48 states in the continental U.S. 
and the District of Coltmbia 
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The Group A and Group B variables differ only In that, In the latter 
group, we distinguish between Orientals and American Indians, on the one 
hand, and all other persons who are neither black nor Hispanic, on the 
other. More or less by definition, this group of all other persons might- 
be called majority white. 

Note that our Group C variables (reflecting human capital, geographic 
location, and the like) are quite similar to the ones used in our SIE 
regression models in some respects, but are rather different in other 
respects*. In particular, the CPDF data permit us to derive educational 
attainment indicators that are more detailed than the ones that can be 
obtained from the SIE data: for example, the latter do not contain any 
measures of the number of years elapsed since highest degree, or of the 
field of the highest degree, while the CPDF data do; and while the SIE 
measures the number of years of school completed, the CPDF data provide 
somewhat more information about the amount and kind of educational 
attainment than the simple amount of Mme spent in school. The CPDF data 
also contain a measur.^ of years of employment in the federal government, 
while the SIE data do not contain any measure of actual work experience, 
even with one's present employer. Of course, on the other hand, the CPDF 
data do not contain measures of some variables of interest that are 
available in the SIE. For example, the CPDF data do no^. contain aviy 
information on language skills and also do not differentiate between race 
or ethnicity. That is, the SIE data classify persons according to both 
race and ethnicity (which, for example, permits one to differentiate be- 
tween black and white Hispanics), while in the CPDF classification scheme 
race and ethnicity are defined in such a way as to make black and 
Hispanic mutually exclusive. 
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We use the variables listed above to form two different regression 
models. The first model uses the simple Group A race-ethnicity Indica- 
tors and the Group C variables, while the second model uses the expanded 
Group 3 race-ethnicity indicators and the Group C variables. Note that 
the first model, comprising Group A and Group C variables, is most com- 
parable to the basic model used in our SIE regressions. 

Because of the problems associated with estimating many parameters in 
logit models we use a smaller set of tha available variables in our ana- 
lysis of labor force status. The variable list for the logit analyses 
based on the SIK data is as follows: 

Dependent Variable 

labor force status, categorized as follows: 
employed in the federal sector 
employed in the non-^federal sector 
unemployed 

not in the labor force 
Independent Variables 

number of years of formal education 

potential experience (• age minus years of schooling minus 5) 
1 if age Is over 30 and under 41, 0 otherwise 
1 if age is over 40 and under 51, 0 otherwise 
1 if age is over 50, 0 otherwise 

niimber of years lived in U.S. (equal to age, for persons born in 
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1 If born in U.S., spoke English as a child, and speaks English now, 

0 otherwise 

1 if married %rith spouse present, 0 otherwise 
number of persons in household 

percent of population in area (classified by state, SMSA, and central 
city) that is Hispanic 

percent of population in area that is black non-Hispanic 

percent Hispanic in area times years of school 

percent black non-Hispanic in area times years of school 

percent Hispanic in area times potential experience 

percent black non-Hispanic in area times potential experience 

I if of Puerto Rican origin, 0 otherwise 

We estimate various logit models, containing alternative combinations 
of these variables, separately for each sex, using separate samples of 
Hispanics, black non-Hispanics , and white (that is, other) non-Hispanics. 
Note that an indicator for Puerto Rican ethnicity cannot be included in 
logit s for samples of black or white non-Hispanics because, by defini- 
tion, this indicator has a value of zero for all such persons. On the 
other hand, wa do include such an indicator in logit s for samples of 
Hispanics in order to distinguish between Puerto Ricans and other 
Hispanics. 

Liu^^OR FORCE STATUS RESULTS 

One of our principal interests in this research is to compare the 
federal and non-federal sectors. The implications of our logit models 
with respect to employment in these two sectors are summarized in Table 
1, which compares the actual and predicted employment sector for each 
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Table 1 



Comparison of Minorities' Predicted Employment Proportions 
(Predictions use logit coefficients from the sample of whites) 



federal E mployees Private Employees 

Actual Predicted X Actual Predicted 2 

^ 2 Diff. J J Diff. 



Men 



Hispanic 


4.53 


3.57 


26.9 


75.82 


75.08 


1.0 


Puerto Rican 


4.58 


2.49 


83.9 


68.64 


77.22 


-11.1 


Hispanic non-Puerto Rican 


4.53 


3.71 


22.1 


76.89 


77.22 


2.8 


Black 


5.07 


2.98 


70.1 


67.62 


73.17 


-7.6 


White 


3.91 






78.41 







Women 



Hispanic 1.65 


1.62 


1.9 


46.61 


47.45 


-i.8 


Puarto Rican 1.35 


1.00 


35.0 


34.28 


51.64 


33.6 


Hispanic non-Puerto Rican 1.69 


1.71 


-1.2 


48.43 


47.06 


2.9 


Black 3.53 


1.29 


173.6 


50.96 


52.08 


-2.2 


White 1.60 






43.95 






Note: Data base is the Survey of 
analysis . 


loccme and 


Education; 


see text for 


description 


of 
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race/ethnic group. The comparison Is based on the characteristics of 
each individual in the sample (regardless of actual sector). A predicted 
probability vas generated using the estimated logit coefficients for 
labor force status from the white sample. All comparisons in this table 
concerning under- or overrepresentation are made relative to white 
non-Hispanics (men, in the case of the other male ethnic groups; or 
women, in the case of the other female ethnic groups) I A positive entry 
for a given sector in the column headed *Z Diff.- indicates that the 
group in question is overrepresented in that sector relative to white 
non-Hispanica of the same sex with the same educational attainment, age, 
etc.; a negative entry indicates undc^rrepreaentation. 

The main implications of Table 1 may be* summarized as follows. 
First, virtually all minority ethnic groups (that is, groups other than 
white non-Hispanics) are substantially overrepresented in federal 
employment relative to white non-Hispanics. (The only exceptions to this 
generalization are Hispanic non-Puerto Mean women, who are slightly 
underrepresented in federal employment, and Hispanic women generally, 
who are only slightly overrepresented in federal employment, on average.) 
However, note that such overrepresentation in federal employment is only 
a small proportion of any given groupie population. (For example, Table 
1 indicates that men of Puerto Rican origin are overrepresented in 
federal employment in the sense that the actual proportion of such men in 
federal employment is 4.58Z, as opposed to the 2.49% that would be 
expected if this group acted and were treated in regard to labor force 
status as white men with identical schooling, age, etc.) In this sense, 
an end to such overrepresentation would not involve the reallocation of 
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a large r umber of persons. Second, Puerto Ricans of either sex are also 
substantially underrepresented in non-federal employment relative to com- 
parable white non-Hispanics. Third, bl.^ck non-Hispanic males are also 
underrepresented in non-federal employment. Recall that these differen- 
ces in labor force status cannot be attributed exclusively to either 
supply or demand factors (e.g., to individual tastes or to employer 
discrimination) since the estimated version of the logit model does not 
identify either of these two behavioral relationships separately 

To complement Table 1, we present in Table 2 a summary of the in^v^ica- 
tions of our logit results concerning the relation between ethnicity and 
nonemployT3ient , i.e., either unemployment or absence from the labor force. 
This shows that both men and women in each of the minority ethnic groups 
considered in our analyses are overrepresented among the unemployed, 
relative to whites with comparable schooling, age, family composition, 
etc. Non-Puerto Rlcan Hispanics of either sex and black women tend to be 
underrepresented among persons not in the labor force; Puerto Ricans of 
either sex, and black men, tend to be overrepresented. 

All things considered, our logit results suggest that ethnicity as 
such does not have a particularly pronounced association with labor force 
status once one holds constant the effects of other supply and demand 
factors such as age, schooling, family composition, and the like* One 
simple way to illustrate this is shown in Table 3. In this table, we 
show how changing the ethnicity of all ethnic groups to white 
non-Hispanic (without changing their age, schooling, etc.) would alter 
the distribution of our total sample by labor force status. As shown 
there, changing the ethnicity of all persons in our sample to white would 
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Comparison of Minorities* Predicted Unemployment 
and Not-in-Labor-Porce Proportions 
(Predictions use logit coefficients from the sample of whites) 









Unemployed 




Not 


in Labor Force 


Actual 
Z 


Predicted 
Z 


Z 

Diff. 


Actual 
Z 


* ^ C M ^ W k C U 

z 


X 

Diff. 


Men 
















Hispanic 




5.83 


5.43 


7.4 


18.03 


20.23 


-10.9 


Puerto Rican 




8.92 


7.24 


23.2 


21.51 


17.09 


25.9 


Hispanic non-Puerto 


Rican 


5.43 


5.20 


.2 


17.58 


20.63 


-14.8 


Black 




7.16 


5.20 


37.7 


27.38 


21.64 


26.5 


White 




3.45 


— 


— 


22.54 






Women 
















Hispanic 




4.90 


4.96 


7.5 


52.80 


53.52 


-1.3 


Puerto Rican 




6.37 


4.87 


30.8 


62.74 


49.73 


26.2 


Hispanic non-Puerto 


Rican 


4.70 


4.52 


4.0 


51.44 


54.04 


-4.8 


Black 




7.21 


4.78 


50.8 


41.20 


46.69 


-11.8 


White 




3.32 






51.13 







Note: Data base is the Survey of Income and Education; see text for description of 
analysis • 



5,i 

ERIC 



125 



Table 3 

Comparison of Predicted and Actualy^Labor Force 
Distribution for Entire Sample 
(Predictions use logit coefficients from the sample of whites) 







fN • 10.025^ 




(N - 11.361) 




Actual 
Z 


If White, 
Predicted Z 


Actual 
Z 


If White, 
Predicted Z 


Labor Force Status 










Employed 
In' federal sector 
In non^federal sector 


72.86 
4.40 
68.46 


73.94 
3.57 
70.37 


45.78 
2.09 
43.69 


44.89 
1.53 
43.36 




5.22 


4.62 


4.86 


4.14 


Not in labor force 


21.92 


21.44 


49.36 


50.97 


Ethnicity 










Hispanic 
Puerto Rican 
Other 


38.51 
4.36 
34.15 




37.91 
4.56 
33.35 




Black 


22.99 




24.16 




White 


38.49 




37.93 





Note: Data base is the Survey of Income and Education; see text for 
description of analysis. 
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produce rather small shifts in the distribution of our total sample by 
labor force status. For example, the proportion unemployed among men 
would fall about 0.6 of a percentage point, while the proportion unemployed 
among women would fall by about 0.7 of a percentage point. (Recall, 
also, that our total sample for each sex consists of roughly equal num- 
bers of Hispanica and white non-Hlspanics , with somewhat smaller numbers 
of black non-His panics ♦ Thus, minorities are substantially overrepre- 
sented in our sample relative to their representation in the population — 
meaning that any changes of the kind shown in Table 3 would be much 
smaller in the actual population than they are in our sample.) 

DIRECT REGRESSION RESULTS FROM THE SIE DATA 

In this section we discuss our direct (conventional least squares) 
regression results on ethnic pay differences, taking each sex in turn. 
(See J. Abowd and Killingsworth, 1981, for detailed tables; A. Abowd, 
1982, for alternative specifications.) 

Results for Men 

1. Magnitudes . The pay differential for a given ethnic group rela- 
tive to comparable white noa-Hlspanics varies considerably by ethnic 
group (and, as noted beloi*, to a lesser extent by sector). All differen- 
tials are negative, implying that minority ethnic groups tend to be paid 
less than whites who are otherwise comparable (in terms of the other 
variables in the regression model from which the differential is 
derived). They are largest in absolute value (between about -.14 to 
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-.15) for black non-Hlspanlcs, smallest in absolute value (between about 
-•01 to -.05) for non-Puerto Rican Hlspanlcs, and of intermediate size 
(between about -.07 to -.13) for Puerto Rlcans. 

2* Statistical significance . As measured by t-statistics , the sta- 
tistical significance of these pay differentials is generally quite 
substantial for blacks (t-ratios for most black-white differentials are 
between about 5.9 and 9.9); t-ratios for most Puerto Rican-white dif- 
ferentials are considerably lower (between about 0.8 and 2.2). Most dif- 
ferentials between non-Puerto Rlcan Hispanics and comparable whites would 
not be judged statistically different from zero at conventional levols 
(t-ratios for most of these differentials are between about 0.3 and 1.5). 

3. Sectoral patterns . For all three m'inority ethnic groups, 
minority-white differentials in yages are larger in absolute value (that 
is, more negative) in the federal sector than in the non-federal sector, 
while minority-white differentials in earnings are larger in the non- 
federal sector than in the federal sector. For example, the black-white 
wage differential in the federal versus non-federal sector is about -.16 
to -.18 (-.14), while the comparable figure for the earnings differential 
in the federal versus non-federal sector is about -.16 to -.17 (-.25). 

Alternative dependent variables . For all three minority groups, 
the wage differential is larger than the earnings differential in the 
federal sector, but smaller than the earnings differen^^ Ln the non- 
federal sector. For exam ple, for Puerto Ricans, the ws^ge (Learnings) dif- 
ferential is about -.12 to -.13 (-.08 to -.10) in the federal sector, 
while in the non-federal sector the wage (earnings) differential is about 
-.07 to -.08 (-.13). 
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5» Alternative models * For all three minority groups, estimates of 
a given differential are relatively robust with respect to alternative 
models (that is, use of alternative sets of independent variables)* For 
er.ample, regression estimates of the Puerto Rican-vbite differential are 
about -.OS to '•IZ when population proportion variables are not included 
ai:d are about -".lO to ".13 %rhen such variables are included among these 
regressors* (Changes in differentials for most other race/ethnic groups 
attendant upon inclusion of these variables are smaller still.) 

Results for Women 

1« Magnitudes • With a few exceptions, minority-white pay differen- 
tials among women are smaller than among men, and many are either posi- 
tive (implying that certain groups of minority women are paid more than 
comparable white wmen) or else essentially zero, in a statistical sense 
The black-white pay differential among women is about .05 to -.05; the 
Puerto Rican-white female pay differential is about .12 to about -.40; 
and the non-Puerto Rican Hispanic-white pay differential is about .04 to 
-.13. 

2. Statistical significance . On the %*hole, the statistical signifi 
cance of minority-white pay differentials, as measured by their t-ratios 
is lower among women than among men. Black-white differentials among 
women liave t-ratios in the • mge 0.6 to 2.1; Puerto Rican-^' differen 
tials have t-ratios betwe en and 1»6; and non-Puertc <i< Hispanic- 
white differentials have t-rauios between .7 and 2.3. 

3. Sectoral ^patterns . With a few exceptions, minority-white pay 
differentials among women are nx^re negative (that is, lower in absolute 
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value) in the federal than in the non-federal sector. For example, the 
black-white pay differential is about -.04 to -.05 in the federal sector, 
while differentials in the non-federal sector are between about .05 and 
-.05. 

Alternative dependent variables . Black-white and non-Puerto 
Rican Hispanic-white differentials in wages are typically more negative 
(that is, larger in absolute value if negative, or smaller In absolute 
value if positive) than are earnings differentials; while in the case of 
Puerto Rican-white differentials just the reverse holds. For example, 
the Puerto Rican-white wage differential is about .06 to .12, while che 
earnings differential is about -.01 to -.40; black-white and non-Puerto 
Rican Hispanic-white differentials in wages (earnings) are about -.02 to 
-.13 (.04 to -.09). 

5* Alternative models . For all three minority groups, estimates of 
pay differentials are relatively robust with respect to alternative 
models (that is, use of alternative sets of independent variables). For 
example, regression estimates of the Puerto Rican-white wage (earnings) 
differential are about .06 to .12 (-.01 to -.40) when population propor- 
tion variables are not included, and about .06 to .14 (-.01 to -.39) when 
such variables are included among the regreasors in a given model. 

REVERSE REGRESSION RESULTS FROM THE SIE DATA 

In this «;ec-icn we present the results of our reverse regression 
analysis for each sex. (See J. Abowd and Killingsworth, 1981, for 
detailed tables and A. Abowd, 1982, for a discussion of alternative 
specifications.) 
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Results for Men 

!• Magnitudes > The pay differential for a given ethnic group rela- 
tive to comparable %rtiite non-Hispanics varies considerably by ethnic 
group (and, as noted below, to i lesser extent by sector). Unlike the 
direct regression differentials, most of which are negative (implying 
that minorities tend to receive Uf.^^r pay than comparable whites), most 
of the reverse regression differentials are positive (implying that 
minorities tend to receive higher pay than comparable whites). The 
black-white differential is between about .06 and -.05; the non-Puerto 
Rican Hispanic-white differential is between about .14 and .02; and the 
Puerto Rican-white differential is between about .06 and -.01. 

2. Statistical significance . As measured by their t-statistics , the 
statistical significance of these pay differentials is generally quite 
substantial for non-Puerto Rican Hispanics (t-statisti '•.s for this group 
are between about 3.7 and 5.9). Black-white differentials in the federal 
sector, and Puerto Rican-white differentials in the non-federal sector, 
also have relatively high t-ratios (between about 3.4 and 4.2, and be- 
tween about 1.5 and 3.1, respectively). However, black-white differen- 
tials in the non-federal sector and Puerto Rican-white differentials in 
the federal sector would not generally be judged different from zero, in 
a statistical sense, at conventional levels of significance. 

3. Sectoral patterns . The magnitudes and even signs of these dif- 
ferentials vary considerably by sector. Puerto Rican-white differentials 
are always smaller in algebraic value (either negative, or else positive 
but small) in the federal sector than in the non-federal sector (the 
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range of federal sector differentials is about ,02 to -•01, while the 
non-federal sector differential is about .06). On the other hand, dif- 
ferentials between non-Puerto Rican Hispanics and comparable whites in 
the non-federal sector (which are in the range ,07 to ,02) are smaller 
than the differentials in the federal sector (which are in the range #14 
to .09). Finally, introducing population proportion variables changes 
completely the sectoral pattern of the black-white differentials. In 
models in which these variables are not included, the black-white dif- 
ferential in the federal versus non-federal sector is -.0^ to -.OS (.02 
to .01), but when such variables are included the differential in the 
federal versus non-federal sector is between about .04 and ,06 (.01 and 



4. Alternative dependent variables > For all three minority groups, 
the wage differential is about the 'lame as the earnings differential both 
in the federal and in the non-federal sector* For example, foo Puerto 



Ricans, the wage (earnings) differential is about •02 to .01 (.00 to 
-•01) in the federal sector, while in the non-federal sector the wage and 
earnings differentials are both about •06 and •OS^ 

5^ Alternat.^ve models • For Puerto Ricans and other Hispanics, esti- 
mates of differentials are relatively robust with respect to alternative 
models (that is , use of alterative sets of independent variables ) • On 
the other hand, the federal black-white differential seems to be fairly 
sensitive to inclusion of population proportion variables^ When such 
variables are excluded, the federal (non-federal) black-white pay dif- 
ferential is between about -.04 and -•OS (.02 and .01)^ and when such 
variables are included , the differential is between about .06 and .04 
(.01 and .00). 
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Women 

1- Magnitudes * Minority-white pay differentials among women exhibit 
lew obvious patterns; very roughly speaking, there appear to be about as 
many positive differentials (implying that minority women are peiid more 
than comparable white women) as negative differentials (implying that 
minority women are paid less than comparable white women), and a large 
number do not appear to be different from zero (in a statistical sense) 
at conventional levels of significance ♦ The black'-white pay differential 
is between about .04 and -.l?; the Puerto Rican-white pay differential is 
between about .14 and -.11; t'.e non-Puerto Rlcan Hispanic-white pay dif- 
ferential is between about .08 and -.01. 

2. Statistical significance . On the whole, the st#*tistical 
significance of minority-white pay differentials, as measured by th^ir t- 
ratios, is lower among vTomen than among men. Black-white differentials 
ainong women have t-ratioa in the range 1.1 to 8.6; Puerto Rican-^hite 
differentials have t-ratios between .4 and .5.1; and non-Puerto Rican 
Hispanic-white differentials have t-ratlos between .2 and 7.6. 

3. Sectoral patterns * With a few exceptions, minority-white pay 
differentials among women are lower in algebraic value (that is, larger 
in absolute value if negative, and smaller if positive) in the federal 
than in the non-federal sector. For example, the black-white pay dif- 
ferential is about -.02 to -.17 in the federal sector, while differen- 
tials in the non-federal sector ate between about .04 and .02. 

4. Alternative dependent variables . Black-white and Puerto Rican- 
white differentials in earnings are typically more negative (that is. 
larger in absolute value if negative, or smaller in absolute value if 
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positive) than are wage differentials. For example, the Puerto Rican- 
white wage differential is about .14 to .03, while the earnings differen- 
tial is about -.04 to -.11. On the other hand, non-Puerto Rican 
Hispanic-white differentials in wages are greater in algebraic value in 
the federal sector, and; are smaller in the non-federal sector, than are 
non-Puerto Rican Hispanic-white differentials in earnings. 

5» Alternative models ^ For all three minority groups, estimates of 
pay differentials seem fairly robust with respect to alternative models 
(that is, use of alternative sets of independent variables). For example, 
regression estimates of the Puerto Rican-white wage (earnings) differen- 
tial are about .14 to .03 (-.04 to -.11) when population proportion 
variables are not included, md about .13 to .05 (-.05 to -.08) when such 
variables are included among the regressors in a given model. 

STRUCTURAL REGRESSION RESULTS FROM TIIE SIE DATA 

We now discuss our structural (instrumental variable) regression 
results for each sex. (See J. Abowd and Killingsworth, 1981, for 
detailed tables, and A. Abowd, 1982, for a discussion of alternative spe- 
cifications .) 

Results for Men 

1. Magnitudes . The pay differential for a given ethnic group rela- 
tive to comparable white non-Hispanics varies considerably by ethnic 
grojip (and, as noted below, to a lesser extent by sector). la most 
cases., these differentials are negative (implying that minority groups 
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are paid less than comparable whites), and many of them are quite close 
to the corresponding direct vage regression differential* (We say more 
about this below. ) Differentials are largest In absolute value (between 
about -.14 to '-•25) for black non-Hlspanlcs, smallest In absolute value 
(between about --Ol to -.05) for non-Puerto Rlcan Hlspanlcs, and of 
Intermediate size (between about -.07 to -,14) for Puerto Rlcans» 

2* Statistical significance . As measured by their t-statlstics , the 
statistical significance of these pay differentials Is generally quite 
substantial for blacks (t-ratlos for most black-white differentials are 
between about 6*2 and 10.4); t-ratloB for most Puerto Rlcan-vhlte dif- 
ferentials are considerably lower (between about 1.0 and 2.1). Most dif- 
ferentials between non-Puerto Rlcan Hlspanlcs and comparable whites would 
not be Judged statistically different from zero at conventional levels 
(t-ratlo8 for most of these differentials are between abou^ 0.3 and I. 5). 

3. Sectoral patterns * For all three minority ethnic groups » 
minority-white differentials In wages are larger absolute value (that 
Is, more negative) in the federal sector than in the non-federal sector, 
while minority-white differentials in earnings are larger in the non- 
federal sector than in the federal sector. For example, the black-white 
wage differential in the federal versus non-federal sector is about -.18 
to -.20 (-.14), while the comparable figure for the earnings differential 
in the federal versus non-federal sector is about -.17 to -.19 (-.25). 

4. Alternative dependent variables > For all three minority groups, 
the wage differential is larger than the earnings differential in the 
federal sector, but smaller than the earnings differential in the non- 
federal sector. For example, for Puerto Ricans, the wage (eax-nings) 
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differential is about -.14 (-.11) In the federal sector, while In the 
non-federal sector the wage (earnings) differential Is about -.07 to -.08 
(-.13). 

5. Alternative models . For all three minority groups, estimates of 
a given differential are r^^latlvely robust with respect to alternative 
models (that is, use of alternative sets of Independent variables). For 
example, regression estimates of the Puerto Rican-white differential are 
about -.08 to -.14 when population proportion variables are not included 
and are about -.07 to -.14 when such variables are Included among the 
regressors. 

Results for Women 

1- Magnitudes . With a few e3^:ceptions , minority-white pay differen- 
tials among women ar^ smaller than among men; most are fairly similar to 
the corresponding direct wage regression estimate; many are essentially 
zero, in a statistical sense* The black-white pay differential among 
women is about .06 to -.09; the Puerto Rican-white female pay differen- 
tial is about .29 to -.53; and the non-Puerto Rlcan Hispanic-white pay 
differential is about .03 to -.36. 

2. Statistical significance . On the whole, the statistical signi- 
ficance of minority-white pay differentials, as measured by their t- 
ratlos, is lower among women than among men. Black-white differentials 
among women have t-ratlos in the range 0.1 to 1.7; Puerto Rican-white 
differentials have t-ratlos between 0.2 and 0.9; and non-Puerto Rican 
Hispanic-white differentials have t-ratlos between 0.4 and 2.0. 
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3. Sectoral patterns * Black-white and non-Puerto Rlcan Hispanic- 
white pay differentials among women are usxxally somewhat more negative 
(that is, lower in absolute value) in the federal than in the non-federal 
sector. For example, the black-white differential is about -•OO to -.09 
in the federal sector, while differentials in the non-federal sector are 
between .06 and -.02. Finally, the Puerto Rican-white differential is 
always larger in absolute value in the federal sector than it is in the 
non-federal sector— but the estimated wage differentials imply that 
Puerto Ricans are paid more than comparable whites, particularly in the 
federal sector, while the estimated earnings differentials imply that 
Puerto Ricans are paid less than comparable whites, especially in the 
federal sector. (See below.) 

4. Alternative dependent variables . Black-white and non-Puerto 
Rican Hispanic-white differentials in wages are typically more negative 
(that is, larger in absolute value if negative, or smaller in absolute 
value if positive) than are earnings differentials; while in the case of 
Puerto Rican-white wage differentials just the reverse holds. For 
example, the Puerto Rican-white wage differential is about .25 to .29 in 
the federal sector (vs. about .05 in the non-federal sector), while the 
differential in earnings in the federal sector is about -.39 to -.53 
(vs. -.02 to -.03 in the non-federal sector). 

5. Alternative models * For all three minority groups, estimates of 
pay differentials are fairly robust with respect to alternative models 
(that is, use of alternative sets of independent variables). For example, 
regression estimates of the black-white wage (earnings) differential are 
about .05 to .29 (-.02 to -.53) when population proportion variables are 
not included and about .05 to .25 (-.03 to .39) when such variables are 
included among the regressors in a given model. 
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DIRECT AND REVERSE WAGE REGRESSION RESULTS FROM THE CPDF DATA 

In this section we discuss direct and reverse regression results 
derived from the federal government CPDF data. 

1. Results by race/ethni city and sex . In general, the CPDF results 
seem fairly similar to the SIE results as regards racial and ethnic pay 
differentials by sex within the federal sector. As in the SIE results, 
the CPDF results Imply that both Hispanica and blacks are paid less 
within the federal sector than are whites (that is, either non-black 
non-Hlspanics, including American Indians and Orientals as well as 
majority whites; or majority whites as such). In general, black-white 
pay differentials in the CPDF results are larger in absolute value than 
Hispanic-white pay differentials; and, for either racial-ethnic group, 
the minority-white differential among men is larger than the minority- 
white differential among women. Most of the CPDF differentials are sta- 
tistically different from zero at reasonable levels of significance. 

2* Results by type of statistical model . In our CPDF results, as in 
our SIE results, reverse wage regression generally produces estimates of 
differentials that are less negative than those derived using direct wage 
regression; indeed, in several Instances (notably for Hlspanics), the 
direct wage regression estimate of the minority-white differential has 
a negative sign (implying that minority persons are paid less than com- 
parable whites), but the reverse wage regression estimate is positive 
(implying that minority persons are paid more than comparable whites). 
Black women are an exception to this generalization, however; in some 
cases, the reverse wage regression estimate of the black-white differen- 
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tial for women is slightly more negative than the corresponding direct 
wage regression estimate* ?inall7> the shrinkage in the estimated dif- 
ferential (that is, the extent to which use of reverse wage regression 
makes a given differential less negative) seems> in general, to be 
smaller in the CPDF data than in the SIE data. 

^* Comparison with results derived from the SIE > On the whole, both 
the direct an^' reverse wage regression estimates of the black-white dif- 
ferential derived from the CPDF are similar to the direct and reverse 
wage regression estimates of this differential derived from the SI£« 
(However, the CPDF direct wage regression black-white differentials among 
men seem somewhat smaller, in absolute value, than the corresponding SIE 
estimates.) On the other hand, the CPDF estimates of the Hispanic-white 
differential seem, in general, to be somewhat closer to zero (either 
smaller if positive, or less negative, if negative) than the 
corresponding SIE estimates. However, the differences between the SIE 
and CPDF estimates do not, in general, seem particularly large. 

COMPARISON OF ALTERNATIVE ESTIMATORS AND RESULTS 

We now consider the alternative estimation techniques that we have 
used in evaluating the determinants of pay. We do this using our pre- 
ferred results from the SIE for men and for women, which we set out in 
Tables 4 and 5, respectively. These results are all evaluated at the 
mean values of all variables for white non-Hispanics and are derived from 
either our basic regression model (in which case they are labeled 
"without population proportions") or from our detailed regression model 
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with population proportions but without three-way interactions (in which 
case they are labeled "with population proportions"). 

In drawing conclusions about our three different estimation tech- 
niques from Tables 4 and 5, it is worth recalling that these techniques 
are concerned with different statistical end conceptual issues. First, 
structural regression is concerned with estimating the answer to the 
first methodological question; that is, with estimating differences in 
employer wage offers. It does not, however, make a correction for 
possible measurement error bias. Second, both direct and reverse wage 
regressions are concerned with estimating the answer to the second method- 
ological question; that is, with estimating differences in compensation 
conditional on employment. Direct regression does not make a correction 
for possible measurement error bias while reverse regression does make a 
correction of this kind. Thus, it would be reasonable to expect that 
these three different techniques would produce different results. The 
key issue is, of course, the extent to which results derived from these 
techniques do in fact differ. 

As before, it seems advisable to consider each sex separately. As 
regards men, it is evident from Table 4 that the reverse regression dif- 
ferentials contrast sharply with both the direct and the structural 
regression differentials: differentials estimated using either of the 
latter two techniques are usually negative (and often significantly dif- 
ferent from zero, in a statistical sense), while differentials estimated 
using the former technique are frequently positive. As regards the 
federal sector, both structural and direct regression differentials are 
negative, but the latter, are usually somewhat smaller in absolute value 
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Table 4 

Summary of Direct, Reverse, and Structural Wage Regression Estimates of 
Ethnic Differentials in Pay for Men Evaluated at Mean Values of Whites 







Federal Sector 




Non-Federal Sector 






Hispanics of 
Puerto Rlcan 
Origin 


Hispanic 
non-Puerto 
Rlcans 


Blacks 


Hispanics of 
Puerto Rican 
Origin 


Hispanic 
non — Pue r t o 
Rlcans 


Blacks 


A* Without Population Proportion Variables 










X • ijOg wages 
direct 


-.1241 
( .0820) 




-.0466 
(.0321) 


-.1789 
(.0183) 


-.0799 
(.0481) 


-.0265 
( .0232) 


-.1426 
(.0192) 


reverse 


.0201 
(.0497) 




.1097 
(.0186) 


-.0471 
(.0111) 


.0619 
( .0201) 


.0509 
f .0085) 


.0215 
(.0897) 


structural 


-.1413 
( .0856) 




-.0513 
(.0342) 


-.1987 
(.0191) 


-.0783 
( .0482) 


-.0257 

( 0232'i 


-.1409 
(.0191) 


2. Log Earning 
direct 


s 

-.0857 
(.1072) 




-.0186 
(.0419) 


-.1650 
(.0238) 


-.1340 
(.0628) 


-.0441 
(.0303) 


-.2476 
(.2503) 


reverse 


.0042 
( .0682) 




.1362 
(.0255) 


-.0517 
(.0151) 


.0479 
(.0323) 


. UHU.J 

(.0137) 


.0133 
(.0145) 


structural 


-.1083 
(.1118) 




-.0404 
(.0447) 


-.1887 
(.0250) 


(.0629) 


(.0303) 


-.2472 
(.0250) 


B. With Population Proportion Variables 










1 . Log Wages 
direct 


-.1279 
(.0821) 




-.0476 
(.0337) 


-.1612 
(.0204) 


-.0697 
( .0482) 


-.0080 
( .0241) 


-.1420 
(.0202) 


reverse 


.0056 
( .0521) 




.0945 
(.0195) 


.0390 
(.0116) 


.0818 
( .0211) 


.0682 
( .00901 


.0093 
(.0095) 


structural 


-.1444 
( .0856) 




-.0503 
( .0356) 


-.1750 
(.0214) 


-.0676 
( .0482) 


-.0070 
(,0241) 


-.1386 
(.0202) 


2< Log Earning 
direct 


s 

-.0965 
( .1073) 




-.0108 
(.0440) 


-.1564 
(.0267) 


-.1304 
(.0630) 


-.0443 
(.0315) 


-.2497 
(.0264) 


reverse 


-.0140 
(.0705) 




.1083 
(.0264) 


.0584 
(.0156) 


.0587 
(.0325) 


.0235 
(.0137) 


.0030 
(.0146) 


structural 


-.1134 
(.1118) 




-.0278 
(.0465) 


-.1726 
(.0280) 


-.1299 
(.0631) 


-.0438 
(.0315) 


-.2475 
(.0264) 



Mote: Data base is the Survey of Income and Education; standard errors are in parentheses. 
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Table 5 



Summary of Direct, Reverse, and Structural Wage Regression Estimates of 
Ethnic Differentials in Pay for Women Evaluated at Mean Values of Whites 





Federal Sector 




Non-Federal Sector 






Hispanics of 
Puerto Rlcan 
Origin 


Hispanic 
non-Puerto 
Ricans 


Blacks 


Hispanics of 
Puerto Rlcan 
Origin 


Hisoanic 
non-Puerto 
Ricans 


Blacks 


A. Without Population Proportions 












1 • Log Wages 
direct 


.1209 
( .1576) 


-.1268 
(.0567) 


-.0501 
( .0236) 


-.0590 
(.0619) 


-.0267 
(.0262) 


-.0119 
(.0189) 


reverse 


.0320 
( .0765) 


.0127 

(.026A) 


-.0512 
(.0110) 


.1398 
(.0276) 


(.0183) 


.0329 
(.0095) 


structural 


.2943 
(.3268) 


-.3615 
(.1784) 


-.0025 
(.0387) 


.0472 
(.0621) 


-.0290 
(.0263) 


-.0147 
(.0189) 


2 . Log Earnings 
direct 


"•3962 v> > iij^- ' " "^ ■< 
(.2522) 


-.0327 
(.0907) 


(.0377) 


-.0075 
(.1074) 


.0350 
(.0454) 


.0537 
(.0327) 


reverse 


-.1133 
(.1340) 


.0574 
(.0462) 


-.1665 
(.0194) 


-.0394 
(.0258) 


.0176 
(.0196) 


.0444 
(.0182) 


structural 


-.5281 
( .5908) 


.1889 
(.3224) 


-.0945 
(.0699) 


-.0185 

(.1077) 


.0332 
(.0456) 


.0563 
(.0329) 


B. With Population Proportions 












1 • Log Wages 
direct 


.1363 
(.1575) 


-.1328 
(.0586) 


-.0485 
(.0263) 


.0622 
(.0621) 


-.0199 
(.0273) 


-.0149 
(.0195) 


reverse 


.0548 
( .0820) 


-.0050 
(.0283) 


-.0205 
(.0118) 


.1349 
(.0281) 


.0802 
(.0105) 


.0174 
(.0097) 


Gtructural 


.2474 
(.3264) 


-.3516 
(.1759) 


-.0045 
(.0381) 


.0508 
(.0622) 


(.0273) 


-.0173 
(.0195) 


2 • Log Earnings 
direct 


-.3997 
( .2523) 


-.0911 
(.0938) 


-.0507 
(.0421) 


-.0144 
(.1077) 


.0350 
(.0473) 


.0402 
(.0338) 


reverse 


-.0754 
(.1392) 


.0208 
(.0480) 


-.1228 
(.0202) 


-.0513 
(.0548) 


.0581 
(.0203) 


.0216 
(.0189) 


structural 


-.3938 

( .:ioli) 


-.1211 
(.3025) 


-.0112 
(.0655) 


-.0256 
(.1080) 


.0321 
(.0474) 


.0415 
(.0339) 



tote: Data base is the Survey of Income and Education; standard errors are In parenthe 
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than the former. On the other hand, structural and direct regression 
differentials for the non-federal sector, while usually negative, are 
also generally quite close to each other; Indeed, In many Instances, a 
structural wage regression differential for the non- federal sector Is 
usually slightly smaller than Its direct wage regression counterpart, 
although the difference Is generally very small. Finally, In n»st 
Instances (particularly as regards the federal sector), t-ratlos for 
structural wage regression differentials are somewhat larger than t- 
ratlos for their direct wage regression counterparts: standard errors of 
estimated structural wage regression differentials are slightly larger 
than standard errors of estimated direct mge regression differentials, 
but the estimates themselves are larger still, particularly for the 
federal sector. 

While Table 4 thus suggests a variety of generalizations concerning 
the impact of using alternative estimation t chnlques as far as estimates 
for men are concerned. Table 5, for wmen, suggests little In the way of 
patterns or stylized facts. The three estimation techniques, applied to 
the federal sector, seem to produce three rather different sets of esti- 
mated ethnic differentials among women. Estimates for the non-federal 
sector derived using the three techniques seem, on the whole and roughly 
speaking, to be somewhat closer together. However, in many cases— and 
to a much greater extent than is true of our results for men — the dif- 
ferentials for women reported in Table 5 would not be judged different 
from zero, at conventional levels of significance, regardless of the tech- 
nique used in estimating them. In this sense, then, the results of these 
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different estimation techniques are closer together than cursory Inspec- 
tion of Table 5 might suggest. 

Table 6 compares the results obtained from both the SIE and the CPDF 
for the year 1975. For the two estimation techniques considered, direct 
and reverse, the results from these data sources are quite similar. 
Essentially the same Inferences are supported In either data set. 



SUMMARY AND CONCLUSIONS 

There Is not much consistent or compelling evidence In our results to 
suggest that minority women generally suffer substantial wage discrimina- 
tion (In either the Question 1 or Question 2 sense) relative to com- 
parable white women. One possible exception to this statement concerns 
black women In the federal sector, where our results usually show negatlvi 
pay differentials. (However, a considerable number of these differen- 
tials do not differ froa zero, in a statistical sense, at reasonable 
levels of significance.) An important caveat in this respect la that our 
data do not contain measures cf actual ^rk experience (Garvey and 
Reimers, 1980). We are, therefore, forced to use a proxy, potential 
experience. 

Second, as regards ethnic differentials in pay among men, our results 
suggest (a) that minority men may suffer discrimination both in terms of 
conditional differentials and in terms of offers, and (b) that estimates 
of the magnitudes of both kinds of discrimination may be subject to 
serious measurement error bias. Part (a) of this conclusion follows in a 
straightforward way from consideration of our direct and structural wage 
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Table 6 

Comparison of Ethnic Pay Differentials for Men and Women 
for 1975 Derived from SIE and CPDP Data 







Men 




Women 




<5TF 
3 JLCi 


















direct 


-.0558 
(.0304) 


-.0543 
(.0080) 


-.1020 
(.0542) 


-.0134 
(.01] 4) 


reverse 


.0993 
(.0176) 


.0283 
(.0062) 


.0146 
(.0251) 


-.0017 
(.0107) 


Blacks 










direct 


-.1787 
(.0182) 


-.1381 
(.0130) 


-.0503 
(.0236) 


-.0603 
(.0151) 


reverse 


-.0471 
(.0111) 


-.0421 
(.0110) 


-.0512 
(.0110) 


-.0441 
(.0147) 



Notes: Standard errors are in parentheaes. SIE coluflins present 

regression differentials derived from the Survey of Income and 
Education for men and women in the federal sector; dependent 
variable « natural logarithm of hourly wages* CPDF columns present 
regression differentials derived from the federal Central Personnel 
Data File; dependent variable « natural logarithm of annualized 
salary. 
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regression results; note that our results provide ouch stronger support 
(in the sense of statistical significance) for this proposition with 
respect to blacks than with respect to Puerto Rican or other Hlspanics. 
Part (b) of this conclusion is prompted by our reverse wage regression 
results. 

Third, our results also suggest that wage discrimination against 
minority males (particularly blacks) is greater m the federal than in 
the non-federal sector, while earnings discrimination against minority 
males (particularly blacks) is smaller in the federal than in the non- 
federal sector. At first sight, this may seem paradoxical: if the non- 
federal sector is better than the federal sector as regards wage discrl- 
minaiiion. why isn't it also better as regards earnings discrimination? 
One possible explanation of this apparent paradox has to do with 
employment instability, which is greater in the non-federal sector than 
in the federal sector: if minorities suffer substantially and dispropor- 
tionately (relative to comparable whites) from the relatively greater 
employment instability (layoffs, etc.) in the non-federal sector, then 
the non-federal sector could well be worse than the federal sector as 
regards earnings differentials even if it is better as regards wagas. 
Our logit results on labor force status appear to suggest that minority 
groups generally are overrepresanted among the unemployed. While this 
finding does not prove the validity of our conjecture about sectoral pat- 
terns in wage vs. earnings differentials, it is certainly consistent with 
It. 

Of course, the notion that discrimination within the federal sector 
may be substantial is not new. Our results not only support this view 
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but nilso suggest: t^omethlng else: discrimination against minority males, 
particularly in terms of wages and with respect to blacks, is of greater 
magnitude in the federal than in the non-federal sector* This is par- 
ticularly noteworthy because previous studies have tended to suggest Just 
the opposite* We su£^(pect that one reason for this is that, in contrast 
with previous work, we have attempted to control in *a fairly detailed 
fashion for purely geographic effects on pay (via differences in the cost 
of living and the like)* Since minorities are generally overrepresented 
in federal employment, and since ti\uch federal employment is concentrated 
in urban areas in particular states, sorting out purely geographic 
effects on pay (in effect, purely compensating or equalizing premia) from 
other kinds of effects, including ethnicity, obviously need not be a tri- 
vial matter* Indeed, the difference between our results and those found 
in previous work suggests that such effects may be Important* 
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NOTES 



^Studies that attempt to decompose earnings differentials into por- 
tions attributable to employer discrimination and portions attributable 
to differences In productivity characteristics such as education Include, 
among others. Blinder (1973), Oaxaca (1973), and Smith (IS , 
Litigation under Title VII of the Civil Rights Act and other antidiscri- 
mination laws and regulations Is implicitly or explicitly concerned with 
the extent to which observed employment and earnings differences between 
sexes or between racial or ethnic groups are attributable to employer 
discrimination per se rather than to other factors such as differences In 
productivity-related characteristics. Analyses of earnings differences 
In the context of legal proceedings Include Baldus and Cole (1980), 
Ehrenberg (1979), and Flnkelsteln (1980). 

2one Important reason for studying employment and earnings differen- 
ces b/ sector Is that such differences may reveal the extent to which a 
particular sector is unusual compared to the rest of the economy. (For 
example, see Smith, 1977.) A second reason Is that nonpecunlary rewards 
to employment may vary by sector: for example, federal government 
employment may entail greater job security or better working conditions 
than employment elsewhere In the economy (Smith, 1977). We define wage 
discrimination as a differential In the total reward to employment, 
Including both pecuniary and nonpecunlary rewards. This reinforces the 
usefulness of an intrasectoral analysis of wage discrimination since 
Important differences in nonpecunlary compensation across sectors are. In 
effect, held constant. On the other hand, the fact that such an analysis 
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may have conceptual advantages over an intersectoral study does not 
necessarily mean that statistical procedures suitable for the latter kind 
of study are also suitable for the former kind of study. 
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